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SYNTHETIC DNA SEQUENCE HAVING ENHANCED 
INSBCTIC7DAL ACTIVITY IN MAIZE 

This application is a continuation in parr application 
of U.S. serial no. 772,027 filed October 4, 1991, which 
disclosure is herein incorporated in its entirety. 

Field of the Invention 

The present invention relates to DNA sequences encoding 
insect icidal proteins, and expression of these sequences in 
plants, 

3ac*cround of the Invention 
Expression of the insecticidal protein (IP) genes 
derived from Bacillus thurinaiensis (Bt) in plants has proven 
extremely difficult. Attempts have been made to express 
chimeric promoter/Bt IP gene combinations in plants. 
Typically, only low levels of protein have been obtained in 
transgenic plants. See , for example, Vaeck et al., Nature 
328:33-37, 1987; Barton et al., Plant Physiol. 85:1103-1109, 
1987; Fischoff et al., Bio/ Techno loorv 5:807-813, 1987. 

One postulated explanation for the cause of low 
expression is that fortuitious transcription processing sites 
produce aberrant forms of 3t IP mRNA transcript. These 
aberrantly processed transcripts are non-functional in a plant, 
in terms of producing an insecticidal protein. Possible 
processing sites include polyadenylation sites, intron splicing 
sites, transcriptional termination signals and transport 
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signals. Mosr genes do not contain sites that will 
deleteriously affect gene expression in chat gene's normal host 
organism. However, the fortuitous occurrence of such 
processing sites in a coding region might complicate the 
expression of that gene in transgenic hosts. ?or example, the 
coding region for the 3t insecticidal crystal protein gene 
derived from 3acillus r.hurinaiensis strain fcurstaki (GENBANX 
3THK0RHD, accession M1S271, B^. tlmrinoiensis var. fcurstaJci, 
HD-I; Geiser et al. Gene 48:109-118 (1986)) as derived directly 
from Bacillus thurinqiensis , might contain sites which prevent 
this gene from being properly processed in plants. 

Further difficulties exist when attempting to express 
3acillus thurinaiensis protein in an organism such as a plant. 
It has been discovered that the codon usage of a native 3t IP 
gene is significantly different from that which is typical of a 
plant gene. In particular, the codon usage of a native 3t IP 
gene is very different from that of a maize gene. As a result, 
the mRNA from this gene may not be efficiently utilized. Codon 
usage might influence the expression of genes at the level of 
translation or transcription or mRNA. processing. To optimize 
an insecticidal gene for expression in plants, attempts have 
been made to alter the gene to resemble, as much as possible, 
genes naturally contained within the host plant to be 
transformed. 

Adang et al., EP 0359472 (1990) , relates to a synthetic 
3acillus thurinaiensis tenebrionis (Btt) gene which is 35% 
homologous to the native Btt gene and which is designed to have 
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an A+T content approximating that found in plants in general. 
Table i of Adang et al. show the codon sequence of a synthetic 
3tt gene which was mad to resemble more closely the normal 
codon distribution of dicot genes. Adang et al. state that a 
synthetic gene coding for IP can be optimised for enhanced 
expression in monocot plants through similar methods , 
presenting the frequency of codon usage of highly expressed 
monocot proteins in Table 1. At page 9, Adang et al. state 
that the synthetic Btt gene is designed to have an A+T content 
of 55% (and, by implication, a G+C content of 45%) . At page 
20, Adang et al. disclose that the synthetic gene is designed 
by altering individual amino acid codons in the native at gene 
to reflect the overall distribution of codons preferred by 
dicot genes for each amino acid within the coding region of the 
gene. Adang et al. further state that only some of the native 
Btt gene codons will be replaced by the most preferred plant 
codon for each amino acid, such that the overall aistribution 
of codons used in dicot proteins is preserved. 

Fischhoff et al., SP 0 385 962 (1990), relates to plant 
genes encoding the crystal protein toxin of Bacillus 
thuringiensis . At table V, Fischhoff et al. disclose percent 
usages for codons for each amino acid. At page 8, Fischoff et 
al . suggest modifying a native Bt gene by removal of putative 
polyadenylation signals and ATTTA sequences. Fischoff et al. 
further suggest scanning the native Bt gene sequence for 
regions with greater than four consecutive adenine or thymine 
nucleotides to identify putative, plant polyadenylation signals. 
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Fischoff et al. state that the nucleotide sequ nee should be 
altered if more than one putative polyadenylation signal is 
identified within ten nucleotides of each other. At page 9, 
Fischoff et al. state that efforts should be made to select 
codons to preferably adjust the G+C content to about 50%. 

Perlalc et al., PNAS USA , 88:3324-3328 (1991), relates 
to modified coding sequences of the Bacillus thurinaiensis 
crylA(b) gene, similar to those shown in Fischoff et al. As 
shown in table 1 at page 3325, the partially modified crylA(b) 
gene of Perlalc et al. is approximately 96% homologous to the 
native crylA(b) gene (1681 of 1743 nucleotides) , with a G+C 
content of 41%, number of plant polyadenylation signal 
sequences (PPSS) reduced from 18 to 7 and number of AITTA 
sequences reduced from 13 to 7. The fully modified crylA(b) 
gene of Periak et al. is disclosed to be fully synthetic (page 
3325, column 1) . This gene is approximately 79% homologous to 
the native crylA(b) gene (1455 of 1845 nucleotides) , with a G+C 
content of 49%, number of plant polyadenylation signal 
sequences (PPSS) reduced to 1 and all ATTTA sequences removed. 

Barton et al., BP 0431 829 (1991), relates to the 
expression of insecticidal toxins in plants. At column 10 , 
Barton et al. describe the construction of a synthetic AalT 
insect toxin , gene encoding a scorpion toxin using the most 
preferred codon for each amino acid according to the chart 
shown in Figure 1 of the document. 

Summary of the Invention 
The present invention is drawn to methods for enhancing 
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expression of heterologous genes in plant cells. Generally, a 
gene or coding region of interest is constructed to provide a 
plant specific preferred codon sequence. In this manner , codon 
usage for a particular protein is altered to increase 
expression in a particular plant. Such plant optimized coding 
sequences can be operably linked to promoters capable of 
directing expression of the coding seqence in a plant cell. 

Specifically, it is one of the objects of the present 
invention to provide synthetic insecticidal protein genes which 
have been optimized for expression in plants. 

It is another object of the present invention to 
provide synthetic Bt insecticidal protein genes to maximize the 
expression of at proteins in a plant, preferably in a maize 
plant. It is one feature of the present invention that a 
synthetic at IP gene is constructed using the most preferred 
maize codons, except for alterations necessary to provide 
ligation sites for construction of the full synthetic 
gene. 

According to the above objects, we have synthesized Bt 
insecticidal crystal protein genes in which the codon usage has 
been altered in order to increase expression in plants, 
particularly maize. However, rather than alter the codon usage 
to resemble a maize gene in terms of overall codon 
distribution, we have optimized the codon usage by using the 
codons which are most preferred in maize (maize preferred 
codons) in the synthesis of the synthetic gene. The optimized 
maize preferred codon usage is effective for expression of high 
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levels of the at insecticidal protein. This might be the 
result of maximizing the amount of 3t insecticidal protein 
translated from a given population of messenger RNAs. The 
synthesis of a Bt IF gene using maize preferred codons also 
tends to eliminate fortuitous processing sites that might occur 
in the native coding sequence. The expression of this 
synthetic gene is significantly higher in maize cells than that 

of the native IP Bt gene. 

Preferred synthetic, maize optimized DNA sequences of 
the present invention derive from the protein encoded by the 
crylA(b) gene in Bacillus fchnrinoianaia var. icurstaki, HD-1; 
Geiser et al., Gene, 48:109-118 (1986) or the crylB gene (AKA 
Crya4 gene) described by 3rizzard and Whiteley, Nuc. Acids. 
aes. , 16:2723 (1988) . The DNA sequence of the native kurstaki 
HD-1 crylA(b) gene is shown as Sequence 1. These proteins are 
active against various lepidopteran insects, including Ostrinia 
nubilalis , the European Corn Borer. 

While the present invention has been exemplified by the 
synthesis of maize optimized Bt protein genes, it is recognized 
that the method can be utilized to optimize expression of any 

protein in plants. 

The instant optimized genes can be fused with a variety 
of promoters, including constitutive, inducible, temporally 
regulated, developmentaliy regulated, tissue-preferred and 
tissue-specific promoters to prepare recombinant DNA molecules, 
i.e., chimeric genes. The maize optimized gene (coding 
sequence) provides substantially higher levels of expression in 
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a transformed plant, when compared with a non -maize optimized 
gene. Accordingly, plants resistant to Coleopteran or 
Lepidopteran pests , such as European com borer and sugarcane 
borer, can be produced. 

It is another object of the present invention to 
provide tissue-preferred and tissue-specific promoters which 
drive the expression of an operatively associated structural 
gene of interest in a specific part or parts of a plant to the 
substantial exclusion of other parts • 

It is another object of the present invention to 
provide pith-preferred promoters . By n pith-pref erred, " it is 
intended that the promoter is capable of directing the 
expression of an operatively associated structural gene in 
greater abundance in the pith of a plant than in the roots f 
outer sheath, and brace roots, and with substantially no 
expression in seed. 

It is yet another object of this invention to provide 
pollen-specific promoters. By "pollen-specif ic r n it is 
intended that the promoter is capable of directing the 
expression of an operatively associated structural gene of 
interest substantially exclusively in the pollen of a plant, 
with negligible expression in any other plant part. By 
"negligible, n it is meant functionally insignificant. 

It is yet another object of the present invention to 
provide recombinant DNA molecules comprising a tissue-preferred 
promoter or tissue-specific promoter operably associated or 
linked to a structural gene of interest, particularly a 
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structural gene encoding an insecticidal protein, and 
expression of the recombinant molecule in a plant. 

It is a further object of the present invention to 
provide transgenic plants which express at least one structural 
gene of interest operatively in a tissue-preferred or 
tissue-specific expression pattern. 

In one specific embodiment of the invention disclosed 
and claimed herein, the tissue-preferred or tissue-specific 
promoter is operably linked to a structural gene encoding an 
insecticidal protein, and a plant is stably transformed with at 
least one such recombinant molecule. The resultant plant will 
be resistant to particular insects which feed on those parts of 
the plant in which the gene(s) is (are) expressed. Preferred 
structural genes encode B.t. insecticidal proteins. More 
preferred are maize optimized B.t. IP genes. 

Brief Description of the Figures 
Fig. 1 is a comparison of the full-length native Bt 
cry IS. (b) gene [BTHKURHD] , a full-length synthetic maize 
optimized Bt crylA(b) gene [flsynbt.fin] and a truncated 
synthetic maize optimized Bt crylA(b) gene [bssyn] . This 
figure shows that the full-length synthetic maize optimized 
cry IX (b) gene sequence matches that of the native crylA(b) gene 
at about 2354 out of 3468 nucleotides (approximately 68% 
homology) . 

Fig. 2 is a comparison of the truncated native Bt 
cryia.(b) gene [BTHKURHD] and a truncated synthetic maize 

8 
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optimized 3t gene [bssyn] . This figure shows that the 
truncated synthetic maize optimized crylA(b) gene sequence 
matches that of the native crylA(b) gene at about 1278 out of 
1947 nucleotides (approximately 66% homology) . 

Fig. 3 is a comparison of the pure maize optimized 3t 
gene sequence [synlT.mze] with a truncated synthetic maize 
optimized 3t gene [bssyn] and a full-length synthetic maize 
optimized 3t gene modified to include restriction sites for 
facilitating construction of the gene [synful.mod] . This 
figure shows that the truncated synthetic maize optimized 
crylA(b) gene sequence matches that of the pure maize optimized 
crylA(b) gene at 1913 out of 1947 nucleotides (approximately 
98% homology) . 

Fig. 4 is a comparison of a native truncated Bt 
crylA(b) gene [BTHKDRHD] with a truncated synthetic crylA(b) 
gene described in Perlak et al., PNAS USA , 88:3324-3328 (1991) 
[PMONBT] and a truncated synthetic maize optimized Bt gene 
[bssyn] . This figure shows that the PMONBT gene sequence 
matches that of the native crylA(b) gene at about 1453 out of 
1845 nucleotides (approximately 79% homology) , while the 
truncated synthetic maize optimized 3t crylA(b) gene matches 
the native crylA(b) gene at about 1209 out of 1345 nucleotides 
(approximately 66% homology) . 

Fig. 5 is a comparison of a truncated synthetic 
crylA(b) gene described in PeriaJc et al., PNAS USA , 
38:3324-3328 (1991) [PMONBT] and a truncated synthetic maize 
optimized Bt crylA(b) gene [bssyn] . This figure shows that the 
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PMONBT gene sequence matches that of the truncated synth tic 
maize optimized 3t crylA(b) gen at about 1410 out of 1845 
nucleotides (approximately 77% homology) . 

Fig. 6 is a full-length, maize optimized Cry IB gene. 
Fig. 7 is a full-length, hybrid, partially maize 
optimized DNA sequence of a CryIA<b> gene which is contained in 
PCIB4434. The synthetic region is from nucleotides 1-1938 
(amino acids 1-646) , and the native region is from nucleotides 
1939-3468 (amino acids 647-11S5) . The fusion point between the 
synthetic and native coding sequences is indicated by a slash 
(/) in the sequence. 

Fig. 8 is a map of pCIB4434. 

Fig. 9 is a full-length, hybrid, maize optimized DNA 
sequence encoding a heat stable CrylA(b) protein, contained in 
pCX355il. 

Fig. 10 is a map of pCX3S51i. 

Fig. 11 is a full-length, hybrid, maize optimized DNA 
sequence encoding a heat stable CrylA(b) protein, contained in 
pCIB5512. 

Fig. 12 is a map of pCXB5512. 

Fig. 13 is a full-length, maize optimized DNA sequence 
encoding a heat stable CrylA(b) protein, contained in pCIB5513. 
Fig. 14 is a map of pCX355l3. 

Fig. 15 is a full-length, maize optimized DNA sequence 
encoding a heat-stable CrylA(b) gene, contained in pCIB5514. 
Fig. 16 is a map of pCIB5514. 
Fig. 17 is a map of pCIB4418. 

10 
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Fig. 18 is a map of pCIB4420. 
Fig. 19 is a map of pCIB4429. 
Fig. 20 is a map of pCI34421. 
Fig. 21 is a map of pCIB4428. 
Fig. 22 is a map of pCXS4430. 

Fig. 23A is a table containing data of crylA(b) protein 
levels in transgenic maize. 

Fig. 23B is a table which summarises results of 
bioassays of Ostrinia and Diatraea on leaf material from maize 
progeny containing a maize optimized Cry IA(b) gene. 

Fig. 23C is a table containing data of crylA(b) protein 
levels in transgenic maize. 

Fig. 23D is a table which summarizes the results of 
bioassays of Ostrinia and Diatraea on leaf material from maize 
progeny containing a synthetic 3t. maize gene operably linked 
to a pith promoter. 

Fig. 23E is a table containing data on expression of 
the crylA(b) gene in transgenic maize using the pith-preferred, 
promoter. 

Fig. 24 is a complete genomic DNA sequence of a maize 
tryptophan synthase-alpha subunit gene. Introns, exons, 
transcription and translation starts, start and stop of cDNA 
are shown. $ = start and end of cDNA; +1 =» transcription 
start; 73******* = primer extension primer; +1 = start of 
translation; +++ = stop codon; bp 1495-99 = CCAA? Box; bp 
1593-1598 = TATAA Box; bp 3720-3725 = poly A addition site; 
# above underlined sequences are PGR primers. 
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Figs. 25A, 25B, 25C and 2SD are Northern blot analyses 
which show differential expression of the maize TrpA subunit 
gene in maize tissue at 2 hour, 4 hour, 18 hour, and 48 hour 
intervals, respectively, at -80-C with DuPont Cronex 
intensifying screens. ?-?ith; Ocob; BR-brace roots; SS«ear 
shank; LP»lower pith; MP -middle pith; OP-upper pith; S-seed; 
L=leaf; R=root; and P (upper left) =t: oral pith- 
Fig. 26 is a Northern blot analysis, the two left lanes 
of which show the maize TrpA gene expression in the leaf (L) 
and pith (P) of Funk inbred lines 211D and 5N984. The five 
right lanes indicate the absence of expression in Funk 211D 
seed total RNA. S(l, 2,3, 4 and 5)= seed at 1, 2, 3, 4 and 5 
weeks post pollenation. L-leaf; P«pith; S#-seed * weeks post 
pollenation. 

Fig. 27 is a Southern blot analysis of genomic DNA Funk 
line 211D, probed with maize TrpA cONA 8-2 (pCIB5600) , wherein 
B denotes Bamfll, 3 denotes 3coRI, SV denotes BcoBV, H denotes 
HINDIII, and S denotes Sad. IX, 5X and 10X denote 
reconstructed gene copy equivalents. 

Fig. 28A is a primer extension analysis which shows the 
transcriptional start of the maize TrpA subunit gene and 
sequencing ladder. Lane +1 and +2 are IX + 0.5X samples of 
primer extension reaction. 

Fig. 28B is an analysis of RNase protection from +2 bp 
to +387 bp at annealing temperatures of 42-C, 48 -C and 54 -C, at 
a 16 hour exposure against film at -80-C with DuPont Cronex 
intensifying screens. 

12 
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Fig. 29 is A map of the original Type II 
pollen-specific cDNA clone. The subclaning of the three EcoRI 
fragments into pBluescript vectors to create pCIB3168, pCIB3169 
and II-. S is illustrated. 

Fig. 30 shows the DMA sequence of the maize 
pollen-specific calcium dependent protein kinase gene cDNA, as 
contained in the 1.0 kb and 0 . S 3cJb fragments of the original 
Type II cDNA clone. The ScoRI site that divides the 1.0 fcb and 
0.5 kb fragments is indicated. This cDNA is not full lengthy 
as the mRNA start site maps 490 bp upstream of the end of the 
cDNA clone. 

Fig. 31 illustrates the tissue-specific expression of 
the pollen CDPK mRNA. RNA from the indicated maize 21 ID 
tissues was denatured, electrophoresed on an agarose gel, 
transferred to nitrocellulose , and probed with the pollen CDPK 
cDNA 0.5 kb fragment. The mRNA is detectable only in the 
pollen, where a strong signal is seen. 

Fig. 32 is am amino acid sequence comparison of the 
pollen CDPK derived protein sequence and the rat protein 3cinase 
2 protein sequence disclosed in Tobimatsu et al., J. 3iol- 
Chem. 263:16082-16086 (1988) . The Align program of the DNAstar 
software package was used to evaluate the sequences. The 
homology to protein kinases occurs in the 5' two thirds of the 
gene, i.e. in the 1.0 3cb fragment. 

Fig. 33 is an amino acid sequence comparison of the 
pollen CDPK derived protein sequence and the human calmodulin 
protein sequence disclosed in Fischer et al., J. 3iol. Chem. 

13 
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253:17055-17062 (1988). The homology to calmodulin occurs in 
the 3' one third of the gene, i.e. in th 0.5 kh fragment. 

Fig. 34 is an amino acid sequence comparison of the 
pollen CDPK derived protein sequence and soybean CDPK. The 
homology occurs over the entire gene. 

Fig. 35 illustrates the sequence of the maize 
pollen-specific CDPK gene. 1.4 Kb of sequence prior to the 
mRNA start site is shown. The positions of the seven exons and 
six introns are depicted under the corresponding DNA sequence. 
The site of polyadenylation in the cDNA clone is indicated. 

Fig. 36 is a map of ?CIB4433. 

Fig. 37 is a fuli-iength f hybrid f maize-optimized DNA 
sequence encoding a heat stable crylA(b) protein. 

Fig. 38 is a map of pCIB5515. 

Description of the Sequences: 

Sequence 1 is the DNA sequence of a full-length native 
Bt crylA(b) gene. 

Sequence 2 is the DNA sequence of a full-length pure 
maize optimized synthetic 3t crylA(b) gene. 

Sequence 3 is the DNA sequence of an approximately 2 Kb 
truncated synthetic maize optimized 3t crylA(b) gene. 

Sequence 4 is the DNA sequence of a full-length 
synthetic maize optimized 3t crylA(b) gene. 

Sequence 5 is the DNA sequence of an approximately 2 Kb 
synthetic 3t gene according to Perlak et al. 

Detailed Description of the Invention 

The following definitions are provided in order to 

14 
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provide clarity with resp ct to the terms as they are used in 
the specification and claims to describe the present invention. 

Maize preferred codon: Preferred codon refers to the 
preference exhibited by a specific host ceil in the usage of 
nucleotide codons to specify a given amino acid. The preferred 
codon for an amino acid for a particular host is the single 
codon which most frequently encodes that amino acid in that 
host. The maize preferred codon for a particular amino acid 
may be derived from known gene sequences from maize. For 
example , maize codon usage for 28 genes from maize plants are 
listed in Table 4 of Murray et al., Nucleic Acids Research , 
17:477-498 (1989), the disclosure of which is incorporated 
herein by reference. For instance, the maize preferred codon 
for alanine is GCC, since, according to pooled sequences of 26 
maize genes in Murray et al., supra , that codon encodes alanine 
36% of the time, compared to GCG (24%), GCA (13%), and GCT 
(27%) . 

Pure maize optimized sequence: An optimized gene or DNA 
sequence refers to a gene in which the nucleotide sequence of a 
native gene has been modified in order to utilize preferred 
codons for maize. For example, a synthetic maize optimized Bt 
crylA(b) gene is one wherein the nucleotide sequence of the 
native Bt crylA(b) gene has been modified such that the codons 
used are the maize preferred codons, as described above. A 
pure maize optimized gene is one in which the nucleotide 
sequence comprises 100 percent of the maize preferred codon 
sequences for a particular polypeptide. For example, the pure 

15 



WO 93/07278 



PCT/US92/08476 



maize optimised Bt crylA(b) gene is one in which the nucleotide 
sequence conrpris s 100 percent maize preferred codon sequences 
and encodes a polypeptide with the same amino acid sequence as 
that produced by the native at crylA(b) gene. The pure 
nucleotide sequence of the optimized gene may be varied to 
permit manipulation of the gene, such as by altering a 
nucleotide to create or eliminate restriction sites. The pure 
nucleotide sequence of the optimized gene may also be varied to 
eliminate potentially deleterious processing sites , such as 
potential polyadenylation sites or intron recognition sites. 

It is recognized that "partially maize optimized, ■ 
sequences may also be utilized. By partially maize optimized, 
it is meant that the coding region of the gene is a chimeric 
(hybrid) f being comprised of sequences derived from a native 
insecticidal gene and sequences which have been optimized for 
expression in maize. A partially optimized gene expresses the 
insecticidal protein at a level sufficient to control insect 
pests, and such expression is at a higher level than achieved 
using native sequences only. Partially maize optimized 
sequences include those which contain at least about 5% 
optimized sequences* 

Full- length Bt Genes: Refers to DNA sequences 
comprising the full nucleotide sequence necessary to encode the 
polypeptide produced by a native 3t gene. For example, the 
native Bt cryIA{b) gene is approximately 3.5 Kb in length and 
encodes a polypeptide which is approximately 1150 amino acids 
in length. A full-length synthetic crylA(b) Bt gene would be 
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at least approximately 3.5 Kb in length. 

Truncated 3t Genes: Refers to DNA sequences comprising 
less than the full nucleotide sequence necessary to encode the 
polypeptide produced by a native 3t gene, but which encodes the 
active toxin portion of the polypeptide, ?or example , a 
truncated synthetic at gene of, approximately L9 Kb encodes the 
active toxin portion of the polypeptide such that the protein 
product exhibits insecticidal activity . 

Tissue-oref erred promoter: The term "tissue- 
preferred promoter - is used to indicate that a given regulatory 
DNA sequence will promote a higher level of transcription of an 
associated structural gene or DNA coding sequence , or of 
expression of the product of the associated- gene as indicated 
by any conventional RNA or protein assay, or that a given DNA 
sequence will demonstrate some differential effect; i.e., that 
the transcription of the associated DNA sequences or the 
expression of a gene product is greater in some tissue than in 
all other tissues of the plant. 

"Tissue-specific promoter* is used to indicate that a 
given regulatory DNA sequence will promote transcription of an 
associated coding DNA sequence essentially entirely in one or 
more tissues of a plant , or in one type of tissue , e.g. green 
tissue, while essentially no transcription of that associated 
coding DNA sequence will occur in all other tissues or types of 
tissues of the plant. 

The present invention provides DNA sequences optimized 
for expression in plants, especially in maize plants. In a 
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preferred embodiment of the present invention, the DNA 
sequences encode the production of an insecticidal toxin, 
preferably a polypeptide sharing substantially the amino acid 
sequence of an insecticidal crystal protein toxin normally 
produced by Bacillus thurincianaia . The synthetic gene may 
encode a truncated or full-length insecticidal protein. 
Especially preferred are synthetic DNA sequences which encode a 
polypeptide effective against insects of the order Lecidoctera 
and Coleoptera . and synthetic DNA sequences which encode a 
polypeptide having an amino acid sequence essentially the same 
as one of the crystal protein toxins of Bacillus thurinaiensis 
variety kurstaki, HD-1. 

The present invention provides synthetic DNA sequences 
effective to yield high expression of active insecticidal 
proteins in plants, preferably maize protoplasts, plant cells 
and plants. The synthetic DNA sequences of the present 
invention have been modified to resemble a maize gene in terms 
of codon usage and G+C content. As a result of these 
modifications, the synthetic DNA sequences of the present 
invention do not contain the potential processing sites which 
are present in the native gene. The resulting synthetic DNA 
sequences (synthetic 3t I? coding sequences) and plant 
transformation vectors containing this synthetic DNA sequence 
(synthetic 3t IP genes) result in surprisingly increased 
expression of the synthetic Bt IP gene, compared to the native 
Bt IP gene, in terms of insecticidal protein production in 
plants, particularly maize. The high level of expression 
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results in maize cells and plants that exhibit resistance to 
iepiciopteran insects , preferably European Corn Borer and 
Diatrea saccharalis , the Sugarcane 3orer. 

The synthetic DNA sequences of the present invention 
are designed to encode insecticidal proteins from 3acillus 
thurinciensis , but are optimized for expression in maize in 
terms of G+C content and codon usage. For example, the maize 
codon usage table described in Murray et al., supra , is used to 
reverse translate the amino acid sequence of the toxin produced 
by the 3acillus thurinciensis subsp. JcurstaJci HD-1 crylA(b) 
gene, using only the most preferred maize codons. The reverse 
translated DNA sequence is referred to as the pure maize 
optimized sequence and is shown as Sequence 4. This sequence 
is subsequently modified to eliminate unwanted restriction 
endonuciease sites f and to create desired restriction 
endonuciease sites. These modifications are designed to 
facilitate cloning of the gene without appreciably altering the 
codon usage or the maize optimized sequence. During the 
cloning procedure, in order to facilitate cloning of the gene, 
other modifications are made in a region that appears 
especially susceptible to errors induced during cloning by the 
polymerase chain reaction (PCR) . The final sequence of the 
maize optimized synthetic 3t IP gene is shown in Sequence 2. A 
comparision of the maize optimized synthetic 3t IP gene with 
the native JcurstaJci crylA(b) 3t gene is shown in Fig. l. 

In a preferred embodiment of the present invention, the 
protein produced by the synthetic DNA sequence is effective 
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against insects of the order LeoidoPtera or Coleootera . In a 
more preferred embodiment, the polypeptide encoded by the 
synthetic DNA sequence consists essentially of the full-length 
or a truncated amino acid sequence of an insecticidal protein 
normally produced by aacillus ftmrincriensis var. kurstaki HD-1 . 
In a particular embodiment, the synthetic DNA sequence encodes 
a polypeptide consisting essentially of a truncated amino acid 
sequence of the 3t CrylA(b) protein. 

The insecticidal proteins of the invention are 
expressed in a plant in an amount sufficient to control insect 
pests, i.e. insect controlling amounts. It is recognized that 
the amount of expression of insecticidal protein in a plant 
necessary to control insects may vary depending upon species of 
plant, type of insect, environmental factors and the like. 
Generally, the insect population will be kept below the 
economic threshold which varies from plant to plant. For 
example, to control European corn borer in maize, the economic 
threshold is .5 eggmass/plant which translates to about 10 
larvae/plant . 

The methods of the invention are useful for controlling 
a wide variety of insects including but not limited to 
rootworms, cutworms, armyworms, particularly fall and beet 
armyworms, wireworms, aphids, corn borers, particularly 
Suropean corn borers, sugarcane borer, lesser corn stalk borer, 
Southwestern com borer, etc. 

In a preferred embodiment of the present invention, the 
synthetic coding DNA sequence optimized for expression in 
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maize comprises a G+C percentage greater than tiiat of the 
native crylA(b) gene. It is preferred that the G+C percentage 
be at least about 50 percent, and more preferably at least 
about 60 percent. It is especially preferred that the G+C 
percent be about 64 percent. 

In another preferred embodiment of the present 
invention, the synthetic coding DNA sequence optimized for 
expression in maize comprises a nucleotide sequence having at 
least about 90 percent homology with the "pure" maize optimized 
nucleotide sequence of the native 3acillus thurinaiensis 
crylA(b) protein, more preferably at least about 95 percent 
homology, and most preferably at least about 98 percent. 

Other preferred embodiments of the present invention 
include synthetic DNA sequences having essentially the DNA 
sequence of Sequence ID No. 4, as well as mutants or variants ' 
thereof; transformation vectors comprising essentially the DNA 
sequence of Sequence ID No. 4; and isolated DNA sequences 
derived from the plasmids ?CI34406, pCI34407, pCIB4413, 
pCI34414, pCIB4416 f pCI34417, pCIB4418, ?CIB4419, pCIB4420, 
?CIB4421, pCIB4423 f pCI34434, pCI34429, ?CIB4431 f pCIB4433. 
Most preferred are isolated DNA sequences derived from the 
plasmids pCIB4418 and pCIB4420, pCIB4434, ?CIB4429, pCI34431, 
and pCIB4433. 

In order to construct one of the maize optimized DNA 
sequences of the present invention, synthetic DNA 
oligonucleotides are made with an average length of about 30 
nucleotides. These oligonucleotides are designed to hybridize 
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to produce fragments comprising the various quarters of rue 
truncated toxin gene. The oligonucleotides for a given quarter 
are hybridized and amplified using ?CR. The quarters are then 
cloned and the cloned quarters are sequenced to find those 
containing the desired sequences. In one instance, the fourth 
quarter, the hybridized oligonucleotides are cloned directly 
without PCS. amplification. Once all clones of four quarters 
are found which contain open reading frames, an intact gene 
encoding the active insecticidal protein is assembled. The 
assembled gene may then be tested for insecticidal activity 
against any insect of interest including the European Corn 
Borer (ECS) and the sugarcane borer. (Examples 5A and 5B, 
respectively) . When a fully functional gene is obtained, it is 
again sequenced to confirm its primary structure. The fully 
functional gene is found to give 100% mortality when bioassayed 
against ECS- The fully functional gene is also modified for 
expression in maize. 

The maize optimized gene is tested in a transient 
expression assay , e.g. a maize transient expression assay. 
The native Bt crylA(b) coding sequence for the active 
insecticidal toxin is not expressed at a detectable level in a 
maize transient expression system. Thus, the level of 
expression of the synthesized gene can be determined. 3y the 
present methods, expression of a protein in a transformed plant 
can be increased at least about 100 fold to about 50,000 fold, 
more specifically at least about 1,000 fold to at least about 
20,000 fold. 
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Increasing expression of an insecticial gene to an 
effective level does not require manipulation of a native gene 
along the entire sequence. Effective expression can be 
achieved by manipulating only a portion of the sequences 
necessary to obtain increased expression. A full -length, maize 
optimized CrylA(b) gene may be prepared which contains a 
protein of the native CrylA(b) sequence. For example, Figure 7 
illustrates a full-length, maize optimized CrylA(b) gene which 
is a synthetic-native hybrid. That is, about 23cb of the gene 
(nucleotides 1-1938) is maize optimized, i.e. synthetic. The 
remainder, C-terminal nucleotides 547-1155, are identical to 
the corresponding sequence native of the CrylA(b) gene. 
Construction of the illustrated gene is described in Example 6, 
below . 

It is recognized that by using the methods described 
herein, a variety of synthetic /native hybrids may be 
constructed and tested for expression. The important aspect of 
hybrid construction is that the protein is produced in 
sufficient amounts to control insect pests. In this manner, 
critical regions of the gene may be identified and such regions 
synthesized using preferred codons. The synthetic sequences 
can be linked with native sequences as demonstrated in the 
Examples below. Generally, N-terminal portions or processing 
sites can be synthesized and substituted in the native coding 
sequence for enhanced expression in plants. 

In another embodiment of the present invention, the 
maize optimized genes encoding crylA(b) protein may be 
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manipulated to render the encoded protein more heat stable or 
temperature stable compared to the native crylA(b) protein. It 
has been shown that the crylA(b) gene found in Bacillus 
fhurinoiensis fcurstaxi HD-I contains a 26 amino acid deletion, 
when compared with the crylA(a) and crylA(c) proteins, in the 
-C0OH half of the protein. This deletion leads to a 
temperature-sensitive crylA(b) protein. See M. Geiser, ES 0 
440 581, entitled "Temperaturstabiles Bacillus 
thurinoiensis -Toxin" . Repair of this deletion with the 
corresponding region from the crylA(a) or crylA(c) protein 
improves the temperature stability of the repaired protein. 
Constructs of the full-length modified crylA(b) synthetic gene 
are designed to insert sequences coding for the missing amino 
acids at the appropriate place in the sequence without altering 
the reading frame and without changing the rest of the protein 
sequence. The full-length synthetic version of the gene is 
assembled by synthesizing a series of double-stranded DNA 
cassettes, each approximately 300 bp in size, using standard 
techniques of DNA synthesis and enzymatic reactions. The 
repaired gene is said to encode a "heat stable" or 
"temperature-stable" crylA(b) protein, since it retains more 
biological activity than its native counterpart when exposed to 
high temperatures. Specific sequences of maize optimized, heat 
stable crylA(b) genes encoding temperature stable proteins are 
set forth in Figs. 9, 11, 13, and IS, and are also described in 
Example 7, below. 

The present invention encompasses maize optimized 
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coding sequences encoding other polypeptides, including those 
of other Bacillus thurinaiensis insecticidal polypeptides or 
insecticidal proteins from other sources. For example, cry IB 
genes can be maize optimized, and then stably introduced into 
plants, particularly maize. The sequence of a maize optimized 
crylB gene constructed in accordance with the present invention 
is set forth in Fig. 6. 

Optimizing a Bt IP gene for expression in maize using 
the maize preferred codon usage according to the present 
invention results in a significant increase in the expression 
of the insecticidal gene. It is anticipated that other genes 
can be synthesized using plant codon preferences to improve 
their expression in maize or other plants. Use of maize codon 
preference is a likely method of optimizing and maximizing 
expression of foreign genes in maize. Such genes include genes 
used as selectable or scoreable markers in maize 
transformation, genes which confer herbicide resistance, genes 
which confer disease resistance, and other genes which confer 
insect resistance. 

The synthetic crylA(b) gene is also inserted into 
Acrrobactsrium vectors which are useful for transformation of a 
large variety of dicotyledonous plant species. (Example 44) . 
Plants stably transformed with the synthetic crylA(b) 
Acrrobacterium vectors exhibit insecticidal activity. 

The native Bt cry LA (b) gene is quite A+T rich. The G+C 
content of the full-length native Bt cry LA (b) gene is 
approximately 39%. The G+C content of a truncated native Bt 

25 



WO 93/07278 



PCT/US92/08476 



crylA(b) gene of about 2 Xb in length is approximately 37%. In 
general, maize coding regions tend to be predominantly G+C 
rich. The modifications made to the at crylA(b) gene result in 
a synthetic IP coding region which has greater than 50% G+C 
content, and has about 65% homology at the DNA level with the 
native crylA(b) gene. The protein encoded by this synthetic 
Cryia.(b) gene is 100% homologous with the native protein, and 
thus retains full function in terms of insect activity. The 
truncated synthetic CrylA(b) IP gene is about 2 Kb in length 
and the gene encodes the active toxin region of the native Bt 
fcarstaJci CrylA(b) insecticidal protein. The length of the 
protein encoded by the truncated synthetic Cry IA(b) gene is 648 
amino acids. 

The synthetic genes of the present invention are useful 
for enhanced expression in transgenic plants, most preferably 
in transformed maize. The transgenic plants of the present 
invention may be used to express the insecticidal CrylA(b) 
protein at a high level, resulting in resistance to insect 
pests, preferably coleopteran or iepidoptaran insects, and most 
preferably European Corn 3orer (ECS) and Sugarcane 3orer. 

In the present invention, the DNA coding sequence of 
the synthetic maize optimized gene may be under the control of 
regulatory elements such as promoters which direct expression 
of the coding sequence. Such regulatory elements, for example, 
include monocot or maize and other monocot functional promoters 
to provide expression of the gene in various parts of the maize 
plant. The regulatory element may be constitutive. That is, 
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it may promote continuous and stable expression of the gene. 
Such promoters include but are not limited to the CaMV 35S 
promoter; the CaMV 19S promoter; tumef aciens promoters such 
as oct opine synthase promoters, mannopine synthase promoters , 
nopaline synthase promoters, or other opine synthase promoters; 
ubiquitin promoters , act in promoters, hist one promoters and 
tubulin promoters. The regulatory element may be a 
tissue-preferential promoter, that is, it may promote higher 
expression in some tissues of a plant than in others. 
Preferably, the tissue-preferential promoter may direct higher 
expression of the synthetic gene in leaves, stems , roots and/ or 
pollen than in seed. The regulatory element may also be 
inducible, such as by heat stress, water stress , insect feeding 
or chemical induction, or may be developmentally regulated. 
Numerous promoters whose expression are known to vary in a 
tissue specific manner are known in the art. One such example 
is the maize phosphoenol pyruvate carboxylase (PEPC) , which is 
green tissue-specific. See , for example , Hudspeth, R.L. and 
Grula, J.W., Plant Molecular Biology 12:579-589, 1989). Other 
green tissue-specific promoters include chlorophyll a/b binding 
protein promoters and RubisCO small subunit promoters. 

The present invention also provides isolated and 
purified pith-preferred promoters. Preferred pith^-pref erred 
promoters are isolated from graminaceous monocots such as 
sugarcane, rice, wheat, sorghum, barley, rye and maize; more 
preferred are those isolated from maize plants. 

In a preferred embodiment., the pith-preferred promoter 
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is isolated from a plant TrpA gene; in a most preferred 
embodiment, it is isolated from a maize TrpA gene. That is, 
the promoter in its native state is operativeiy associated with 
a maize tryptophan synthase- alpha subunit gene (hereinafter 
"TrpA") - The encoded protein has a molecular mass of about 
38kD. Together with another alpha subnit and two beta 
subunits, TrpA forms a multimeric enzyme, tryptophan synthase. 
Each subunit can operate separately, but they function more 
efficiently together. TrpA catalyzes the conversion of indole 
glycerol phosphate to indole. Neither the maize TrpA gene nor 
the encoded protein had been isolated from any plant before 
Applicants' invention. The Arabidopsis thaliana tryptophan 
synthase beta subunit gene has been cloned as described Wright 
et al., The Plant Cell , 4:711-719 (1992). The instant maize 
TrpA gene has no homology to the beta subunit encoding gene. 

The present invention also provides purified 
pollen-specific promoters obtai n able from a plant 
calcium-dependent phosphate kinase (CDPK) gene. That is, in 
its native state, the promoter is operably linked to a plant 
CDPK gene. In a preferred embodiment, the promoter is isolated 
from a maize CDPK gene. 3y "pollen-specific," it is meant that 
the expression of an operativeiy associated structural gene of 
interest is substantially exclusively (i.e. essentially 
entirely) in the pollen of a plant, and is negligible in all 
other plant parts. By "CDPK," it is meant a plant protein 
kinase which has a high affinity for calcium, but not 
calmodulin, and requires calcium, but not calmodulin, for its 
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catalytic activity. 

To obtain tissue-preferred or tissue specific 
promoters , genes encoding tissue specific messenger RNA (mRNA) 
can be obtained by differential screening of a cDNA library. 
For example, a pith-preferred cDNA can be obtained by 
subjecting a pith cDNA library to differential screening using 
cDNA probes obtained from pitii and seed mRNA. See, Molecular 
Cloning, A Laboratory Manual, Sambrook et al. eds. Cold Spring 
Harbor Press: New York (1989) . 

Alternately, tissue specific promoters may be obtained 
by obtaining tissue specific proteins , sequencing the 
N- terminus, synthesizing oligonucleotide probes and using the 
probes to screen a cDNA library. Such procedures are 
exemplified in the Experimental section for the isolation of a 
pollen specific promoter. 

The scope of the present invention in regard to the 
pith-preferred and pollen-specific promoters encompasses 
functionally active fragments of a full-length promoter that 
also are able to direct pith-preferred or pollen-specific 
transcription, respectively, of associated structural genes. 
Functionally active fragments of a promoter DNA sequence may be 
derived from a promoter DNA sequence, by several art -recognized 
procedures, such as, for example, by cleaving the promoter DNA 
sequence using restriction enzymes, synthesizing in accordance 
with the sequence of the promoter DNA sequence, or may be 
obtained through the use of PCR technology. See , e.g. Mullis 
et al., Meth. En2vmol. 155:335-350 (1987); Srlich <ed.), PCR 
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Technology, Stockton Press (New York 1989) . 

Further included within the scope of the instant 
invention are pith-preferred and pollen-specific promoters 
"equivalent- to the full-length promoters. That is, different 
nucleotides, or groups of nucleotides may be modified, added or 
deleted in a manner that does not abolish promoter activity in 
accordance with known procedures. 

A pith-preferred promoter obtained from a maize TrpA 
gene is shown in Fig. 24. Those skilled in the art, with this 
sequence information in hand, will recognize that 
pith-preferred promoters included within the scope of the 
present invention can be obtained from other plants by probing 
pith libraries from these plants with probes derived from the 
maize TrpA structural gene. Probes designed from sequences 
that are highly conserved among TrpA subunit genes of various 
species, as discussed generally in Sxample 17, are preferred. 
Other pollen-specific promoters, which in their native state 
are linked' to plant CDPK genes other than maize, can be 
isolated in similar fashion using probes derived from the 
conserved regions of the maize CDPK gene to probe pollen 
libraries . 

In another embodiment of the present invention, the 
pith-preferred or pollen-specific promoter is operably linked 
to a DNA sequence, i.e. structural gene, encoding a protein of 
interest, to form a recombinant DNA molecule or chimeric gene. 
The phrase "operably linked to" has an art-recognized meaning; 
it may be used interchangeably with "operatively associated 
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with, nm linked to," or "fused to". 

The structural gene may b homologous or heterologous 
with respect to origin of the promoter and/or a target plant 
into which it is transformed. Regardless of relative origin, 
the associated DNA sequence will be expressed in the 
transformed plant in accordance with the expression properties 
of the promoter to which it is liniced. Thus, the choice of 
associated DNA sequence should flow from a desire to have the 
sequence expressed in this fashion. Examples of heterologous 
DNA sequences include those which encode insect icidal proteins, 
e.g. proteins or polypeptides toxic or inhibitory to insects or 
other plant parasitic arthropods, or plant pathogens such as 
fungi, bacteria and nematodes. These heterologous DNA 
sequences encode proteins such as magainins, Zasloff , PNA5 USA. 
34:5449-5453 (1987); cecropins, Hultmark et al., Sur. J. 
Biochem. 127:207-217 (1982); attacins, Hultmark et al., SMBQ j. 
2:571-576 (1983); meiittin, gramicidin S, Katsu et al., 
3iochem. Biophvs. Acta , 939:57-53 (1988); sodium channel 
proteins and synthetic fragments, Oiki et al. PNASJJSA, 
85:2395-2397 (1988); the alpha toxin of Staphylococcus aureusm 
Tobkes et al., Biochenu, 24:1915-1920 (1985); apolipoproteins 
and fragments thereof, Knott et al., Science 230:37 (1985); 
Nakagawa et al., J. Am. Chem. Soc . 107:7087 (1985); 
alamethicin and a variety of synthetic amphipathic peptides, 
Kaiser et al., Ann. Rev. 3ioohvs. 3iophvs: Chem. 16:561-581 
(1987); lectins, Lis et al., Ann. Rev. Biochem. , 55:35-68 
(1986); protease and amylase inhibitors; and insecticidai 
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proteins from Bacillus thurincriensis , particularly the 
delta-endotoxins from 3^ thurincriensis ; and from other bacteria 
or fungi. 

In a preferred embodiment of the invention , a 
pith-preferred promoter obtained from a maize TrpA subunit gene 
or pollen-specific promoter obtained from a maize CDPK gene is 
operably linked to a heterologous DNA sequence encoding a 
Bacillus thurincriensis ("S.t.") insecticidal protein* These 
proteins and the corresponding structural genes are well known 
in the art. See , Hofte and Whiteley, Microbio l. Reviews, 
53:242-255 (1989) . 

While it is recognized that any promoter capable of 
directing expression can be utilized, it may be preferable to 
use heterologous promoters rather than the native promoter of 
the protein of interest. In this manner, chimeric nucleotide 
sequences can be constructed which can be determined based on 
the plant to be transformed as well as the insect pest. For 
example, to control insect pests in maize, a monocot or maize 
promoter can be operably linked to a Bt protein. The maize 
promoter can be selected from tissue-preferred and tissue- 
specific promoters such as pith-preferred and pollen-specific 
promoters, respectively as disclosed herein. 

In some instances, it may be preferred to transform the 
plant cell with more than one chimeric gene construct. Thus, 
for example, a single plant could be transformed with a 
pith-preferred promoter operably linked to a Bt protein as well 
as a pollen-specific promoter operably linked to a 3t protein. 
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The transformed plants would express 3t proteins in tiie plant: 
pith and pollen and to a lesser extent the roots r outer sheath 
and brace roots. 

For various other reasons , particularly management of 
potential insect resistance developing to plant expressed 
insecticidal proteins , it is beneficial to express more than 
one insecticidal protein (IP) in the same plant. One could 
express two different genes (such as two different 3acillus 
thurinaiensis derived delta-endotoxins which bind different 
receptors in the target insect's midgut) in the same tissues , 
or one can selectively express the two toxins in different 
tissues of the same plant using tissue specific promoters. 
Expressing two 3t genes (or any two insecticidal genes) in the 
same plant using three different tissue specific promoters 
presents a problem for production of a plant expressing the 
desired phenotype. Three different promoters driving two 
different genes yieids six different insecticidal genes that 
need to be introduced into the plant at the same time. Also 
needed for the transformation is a selectable marker to aid in 
identification of transformed plants. This means introducing 
seven different genes into the plant at the same time. It is 
most desired that all genes, especially the insecticidal genes, 
integrate into the plant genome at the same locus so they will 
behave as a single gene trait and not as a multiple gene trait 
that will be harder to tracJc during breeding of commercial 
hybrids. The total number of genes can be reduced by using 
differential tissue specific expression of the different 
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insecticidal proteins. 

For example, by fusing crylA(b) with die pollen and PEE 
carboxylase promoters, one would obtain expression of this gene 
in green . tissues and pollen. Fusing a pith-preferred promoter 
with the cry IB delta endotoxin from Bacillus thurinaiensis 
would produce expression of this insecticidal protein most 
abundantly in the pith of a transformed plant, but not in seed 
tissues. Transformation of a plant with three genes, PEP 
carboxylase/cryIA(b) , pollen/ cry IA(b) , and pith/crylB produces 
a plant expressing two different 3t insecticidal endotoxins in 
different tissues of the same plant. CrylA(b) would be 
expressed in the ■ outside" tissues of a plant (particularly 
maize) , that is, in those tissues which European corn borer 
feeds on first after hatching. Should SC3 prove resistant to 
crylA(b) and be able to burrow into the stalk of the plant 
after feeding on leaf tissue and/ or pollen, it would then 
encounter the cry IB delta-endotoxin and be exposed to a second 
insecticidal component. In this manner, one can differentially 
express two different insecticdal components in the same plant 
and decrease the total number of genes necessary to introduce 
as a single genetic unit while at the same time providing 
protection against development of resistance to a single 
insecticidal component. 

Likewise, a plant may be transformed with constructs 
encoding more than one type of insecticidal protein to control 
various insects . Thus, a number of variations may be 
constructed by one of skill in the art. 
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The recombinant DNA molecules of the invention may be 
prepared by manipulating the various elements to place them in 
proper orientation. Thus, adapters or linkers may be employed 
to join the DNA fragments. Other manipulations may be 
performed to provide for convenient restriction sites, removal 
of restriction sites or superfluous DNA. These manipulations 
can be performed by art-recognized methods. See , Sambrook et 
al " Molecul ar Cloning: A Laboratory Manual . Cold Spring 
Harbor Laboratory Press, second edition, 1989. For example, 
methods such as restriction, chewing back or filling in 
overhangs to provide blunt ends, ligation of linkers, 
complementary ends of the DNA fragments can be provided for 
joining and ligation. See , Sambrook et al., supra. 

Other functional DNA sequences may be included in the 
recombinant DNA molecule, depending upon the way in which the 
molecule is to be incorporated into the target plant genome. 
For instance, in the case of Aarobacterium- mediated 
transformation, if Ti- or the Hi- plasmid is used to transform 
the plant cells, the right and left borders of the T-DNA of the 
Ti- and Ri- plasmid will be joined as flanking regions to the 
expression cassette. Aoxobactgrium tumefaciens-mediated 
transformation of plants has been described in Horsch et al . , 
Science. 225:1229 (1985); Marton, Call Culture Somatic Cell 
Genetics of Plants. 1:514-521 (1984); Hoekema, In: The Binary 
Plant Vec tor Svstem Of fset-Drukkeri j Ranters 3.V., 
Alblasserdam, 1985, Chapter V Fraley, et al., Crit. Rev. Plant 
Sci., 4:1-46; and An et al., SMBO J. , 4:277-284 (1985). 
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The recombinant DNA molecules of the invention also can 
include a marker gene to facilitate selection in recombinant 
plant cells. Examples of markers include resistance to a 
biocide such as an -antibiotic, e.g. kanamycin, hygroxaycin, 
chloramphenicol, paramomycin, methotrexate and bleomycin, or a 
herbicide such as imidazolones, sulfonylureas, glyphosate, 
phosphinothricin, or bialaphos. Marker getnes are well known in 
the art. 

In another embodiment of the present invention, 
plants stably transformed with a recombinant DNA molecule or 
chimeric gene as described hereinabove are provided. The 
resultant transgenic plant contains the transformed gene stably 
incorporated into its genome, and will express the structural 
gene operably associated to the promoter in the respective 
fashion. 

Transgenic plants encompassed by the instant invention 
include both monocots and dicots. Representative examples 
include maize, tobacco, tomato, cotton, rape seed, soybean, 
wheat, rice, alfalfa, potato and' sunflower . [others?]. 
Preferred plants include maize, particularly inbred maize 
plants . 

All transformed plants encompassed by the instant 
invention may be prepared by several methods known in the art. 
A. tumefaciens -mediated transformation has been disclosed 
above. Other methods include direct gene transfer into 
protoplasts, Paszkowski et al., SMBO J. , 12:2717 (1984); Loerz 
et al., Mol. Gen. & Genet, , 1199:17 rt (1985); Fromm at al., 
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Hauire 319:719 (1986); raicropro jectiie bombardment, Klein et 
al., Bio /Technology. 6:559-563 (1988); injection into 
protoplasts, cultured cells and tissues, Reich et al., 
Bio/Technology, 4:1001-1004 (1986); or injection into 
raeristematic tissues or seedlings and plants as described by De 
La Pena et al., Nature . 325:274-276 (1987); Graves et al., 
Plant Mol. 31ol., 7:43-50 (1986); Hooyicaas-Van Slogteren et 
al., Nature, 311:763-764 (1984); Grimsley et al., 
Bio /Technology, S:185 (1988); and Grimsley et al., Nature, 
325:177 (1988); and electroporation, WO92/09696. 

The expression pattern of a structural gene operatively 
associated with an instant tissue-preferred or tissue-specific 
promoter in a transformed plant containing the same is critical 
in the case where the structural gene encodes an insecticidal 
protein. For example, the instantly disclosed pith-preferred 
expression pattern will allow the transgenic plant to tolerate 
and withstand pathogens and herbivores that attack primarily 
the pith, but also the brace roots, outer sheath and leaves of 
the plant since the protein will be expressed to a lesser 
extent but still in an insect controlling amount in these plant 
parts, but yet in the case of both types of promoters, will 
leave the seed of the plant unaffected. 

SXAMPLSS 

The following examples further describe the materials 
and methods used in carrying out the invention. They are 
offered by way of illustration, and not by way of limitation. 
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5XAMPLS 1: General Methods 

DNA manipulations were done using procedures that are 
standard in the art. These procedures can often be modified 
and/or substituted without substantively changing the result. 
Except where other references are identified, most of these 
procedures are described in Sambrook et al., Molecula r Cloning: 
A Laboratory Manual , Cold Spring Harbor Laboratory Press , 
second edition , 1989. 

Synthesis of DNA oligomers: 

DNA oligomers which are from about twenty to about 
ninety, preferably from about sixty to about eighty nucleotides 
in length, are synthesized using an Applied Biosystems 
model 3 8 OB DNA synthesiser and standard procedures. The 
oligomers are made using the updated SSCAF3 cycle on a 0.2 
jimole, wide pore, small scale ABI column. The end procedure is 
run trityl off and the oligomer is cleaved from the column 
using the 380B's automatic cleavage cycle. The oligomers are 
then deblocked in excess ammonium hydroxide (NH^OH) at 55 - C for 
8-12 hours. The oligomers are then dried in an evaporator 
using nitrogen gas. After completion, the oligomers are 
resuspended in 0.25 - 0.5 ml of deionized water. 

Purification of synthetic oligomers: 

An aliquot of each oligomer is mixed with an equal 
volume of blue dye\f ormamide mix with the final solution 
containing 0.05% bromophenol blue, 0.05% . xylene cyanol FF, and 
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25 % formamide. This mixture is heated at 95 # C for 10 minutes 
to denature the oligomers. Samples are then applied to a 12 % 
polyacrylamide-urea gel containing 7 M urea (Sambrook et al.) . 
After electrophoresis at 300-400 volts for 3-4 hours using a 
Vertical Slab Gel Unit (Hoefer Scientific Instruments, San 
Francisco, CA) , UV shadowing is used to locate the correct 
sized fragment in the gel which was then excised using a razor 
blade. The purified gel fragment is minced and incubated in 
0.4 M LiCl, 1 mM EDTA (pfl 8) buffer overnight at 37 -C. 

Either of two methods is used to separate the oligomers 
from the polyacrylamide gel remnants: Gene\X 25 uM porous 
polyethylene filter units or Millipore's ultrafree-MC 0.45 uM 
filter units. The purified oligomers are ethanol precipitated, 
recovered by centrifuging in a microfuge for 20 min at 4»C, and 
finally resuspended in TE {10 mM Tris, 1 mM 3DTA, pH 8.0). 
Concentrations are adjusted to 50 ng\ul based on absorption 
readings at 260 nm. 



Kinasing oligomers for size determinations: 

To check the size of 3ome of the oligomers on a 
sequencing gel, kinase labeling reactions are carried out using 
purified synthetic oligomers of each representative size: 
40mers, 60mers, 70mers, 80mers, and 90mers. In each 20 ul 
kinasing reaction, one pmole of purified oligomer is used in a 
buffer of 7.0 mM Tris pH 7.5, 10 mM KC1, i mM MgC12) , 0.5 mM 
DTT, 50 ug/mi BSA. 3000 uCi (3 pmoles) of 32P-gammaATP, and 
8 units of T4 polynucleotide kinase. The kinase reaction is 
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incubated for 1 hour at 37»C, followed by a phenol \ chloroform 
extraction and three ethanol precipitations with glycogen as 
carrier (Tracy, Prep. 3iochem. 11:251-268 (1981) . 

Two gel loadings (one containing 1000 cpm, the other 
containing 2000 cpm) of each reaction are prepared with 25% 
formamide, 0.05% bromophenol blue, and 0.05% xylene cyanol FF. 
The kinased oligomers are boiled for 5 minutes before loading 
on a 6 % polyacrylamide, 7 M urea sequencing gel (BRL Gel 
Mix TM6, BRL, Gaithersburg, MD) . A sequencing reaction of 
plasmid pOC18 is run on the same gel to provide sire markers. 

After electrophoresis, the gel is dried and exposed to 
diagnostic X-ray film (Kodak, X-OMAT AR) . The resulting 
autoradiograph shows all purified oligomers tested to be of the 
correct size. Oligomers which had not been sized directly on 
the sequencing gel are run on a S % polyacrylamide, 7 M urea 
gel (BRL Gel Mix TM6) , using the sized oligomers as size 
markers. All oligomers are denatured first with 25 % formamide 
at 100 *C for 5 minutes before loading on the gel. Sthidium 
bromide staining of the polyacrylamide gel allows all the 
oligomers to be visualized for size determination. 

Hybridizing oligomers for direct cloning: 

Oligomers to be hybridized are pooled together (from 1 
ug to 20 ug total DNA) and kinased at 37 •Q for 1 hour. in IX 
Promega ligation buffer containing 30 mM Tris-HCl pH 7.8, 10 mM 
MgC12, 10 mM DTT, and 1 mM dATP . One to 20 units of T4 
polynucleotide kinase is used in the reaction, depending on the 
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amount of total DNA present . The kinasing reactions are 
stopped by placing the reaction in a boiling water batii £ot 
five minutes. Oligomers to form the 5' termini of the 
hybridized molecules are not Icinased but are added to the 
Icinased oligomers along with additional hybridization buffer 
after heating. The pooled oligomers are in a volume of 50-100 
ul with added hybridization buffer used to adjust the final 
salt conditions to 100 mM NaCl, 120 mM Tris pH 7.5, and 10 mM 
MgC12. The Icinased and non-icinased oligomers are pooled 
together and heated in a boiling water bath for five minutes 
and allowed to slowly cool to room temperature over a period of 
about four hours. The hybridized oligomers are then 
phenol\chioroform extracted, ethanol precipitated, and 
resuspended in 17 ul of TE (10 mM Tris, 1 mM EDTA, pH 8.0) . 
Using this 17 ul, a ligation reaction with a final volume of 20 
ul is assembled (final conditions = 30 mM Tris-HCl pH 7.8, 10 
mM MgC12, 10 mM DTT, 1 mM ATP, and 3 units of T4 DNA ligase 
(Promega, Madison WI) . The ligation is allowed to incubate for 
about 2 hours at room temperature. The hybridizedMigated 
fragments are generally purified on 2% Nusieve gels before 
and\or after cutting with restriction enzymes prior to cloning 
into vectors. A 20 ul volume ligation reaction is assembled 
using 100 ng to 500 ng of each fragment with approximate 
equimoiar amounts of DNA in 30 mM Tris-HCl pH 7.8, 10 mM MgC12, 
10 mM DTT, 1 mM ATP, and 3 units of T4 DNA ligase (Promega, 
Madison, WI) . Ligations are incubated at room temperature for 
2 hours. After ligation, DNA is transformed into frozen 
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competent coli cells using standard procedures (SambrooJc et 
al.) and transf ormants are selected on L3-agar (SambrooJc et 
al.) containing 100 ug/ml ampicillin (see below). 

PCR Reactions for Screening clones in 2^ coli: 

5. coli colonies which contain the correct DNA insert 
are identified using ?CR ( see generally, Sandhu et al., 
3ioTechniaues 7:689-690 (1989)). tJsing a toothpick, colonies 
are scraped from an overnight plate and added to a 20 ul to 45 
wl PCR reaction mix containing about 50 pmoles of each 
hybridizing primer (see example using primers MK23A28 and 
MK25A28 to select orientation of SacII fragment in pHY32#6) , 
200 pm to 400 mM of each dNTP, and IX reaction buffer (Perkin 
Slmer Catus, Norwalk, CT) . After boiling the 3^ coli \ PCR mix 
in a boiling water bath for 10 minutes, 5 ul of ?aq polymerase 
(0.5 units) (PerJcin Slmer Catus, Norwalk, Conn.) in IX reaction 
buffer is added. The PCR reaction parameters are generally set 
with a denaturing step of 94 9 C for 30 seconds f annealing at 
55 *C for 45 seconds, and extension at 72*C for 45 seconds for 
30 to 36 cycles. PCR reaction products are run on agarose or 
Nusieve agarose (FMC) gels to detect the correct fragment size 
amplified. 

Ligations : 

Restriction enzyme digested fragments are either 
purified in 1% LGT (low gelling temperature agarose, FMC) , 2% 
Nusieve (FMC) , or 0.75% agarose using techniques standard in 
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the art. DNA bands are visualized with ethidium bromide and 
bands are recovered from gels by excision with a razor blade. 
Fragments isolated from LGT are ligated directly in the LGT. 
Ten microliters of each recovered DNA. fragment is used to 
assemble the ligation reactions, producing final ligation 
reaction volumes of about 23 u l. After excision with a razor 
blade, the recovered gel bands containing the desired DNA 
fragments are melted and brought to IX ligase buffer and 3 
units of T4 DNA ligase (Promega) are added as described above. 
Fragments isolated from either regular agarose or Nusieve 
agarose are purified from the agarose using ultrafree-MC 0.45 
uM filter units (Millipore) and the fragments are ligated as 
described above. Ligation reactions are incubated at room 
temperature for two hours before transforming into frozen 
competent ^ coli cells using standard procedures (SambrooJc «t 



al.) 



Transformations : 

Frozen competent coli cells of the strain DH5 alpha 
or BB101 are prepared and transformed using standard procedures 
(Sambroo* et al., . ^ Coli "SURE" competent cells are obtained 
from Stratagene (La Jolla, CA, . For ligations carried out in 
LGT agarose, after ligation reactions are complete, 50 mM CaC12 
is added to a final volume of about ISO ul and the solution 
heated at approximately S5-C for about 10 minutes to completely 
melt the agarose. The solution is then mixed and chilled on 
ice for about 10 minutes before the addition of about 200 ul of 
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competent cells which had been thawed on ice. This mixture is 
allowed to incubate for 30 minutes on ice. The mixture is next 
heat shocked at 42 # C for 60 seconds before chilling on ice for 
two minutes. Next, 800 ul of SOC media (20% tryptone, 0.5% 
yeast extract, 10 mM NaCl, 2.5 mM XC1, adjusted to ?H 8 with 5 
N NaOHr 20 mM MgC12:MgS04 mix r and 20 mM glucose; Sambrook et 
al-) is added and the cells are incubated at 37 # C with shaking 
for about one hour before plating on selective media plates. 
Plates typically are L-agar (Sambrook et al.) containing 100 
ug/ml ampicillin. 

When ligations are carried out in a solution without 
agarose, typically 200 ul of frozen competent coli cells 
(strain DH5 alpha (SRL, Gaithersburg, MD or Sure cells, 
Stratagene, La Jolla, CA) are thawed on ice and 5 ul of the 
ligation mixture added. The reaction is incubated on ice for 
about 45 to SO minutes, the cells are then heat shocked at 42 • 
for about 90 seconds. After recovery at room temperature for 
about 10 minutes, 800 ul of SOC medium is added and the cells 
are then incubated 1 hour at 37 »C with shaking and plated as 
above. 

' When screening for inserts into the beta-galactosidase 
gene in some of the standard vectors used, 200 ul of the 
recovered transformation mixture is plated on L3-agar plates 
containing 0.008% X-gal, 80 uM IPTG, and 100 ug/ml ampicillin 
(Sambrook et al.) . The plates are incubated at 37 • overnight 
to allow selection and growth of transf ormants . 
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Miniscreening DMA: 

Transf ormants from the selective media plates are grown 
and their plasmid structure is examined and confirmed using 
standard plasmid mini-screen procedures (Sambrook et al.) . 
Typically, the "boiling" procedure is used to produce small 
amounts of plasmid DNA for analysis (Sambrook et al.). 
Alternatively, an ammonium acetate procedure is used in some 
cases. This procedure is a modification of that reported by 
■Shing-yi Lee et al., Biotechnioues 9:676-679 (1990). 

1) Inoculate a single bacterial colony from the 
overnight selection plates into 5 ml (can be scaled down to 1 
ml) of TB (Sambrook et al.) medium and grow in the presence of 
the appropriate antibiotic. 

2) Incubate on a roller at 37 »C overnight. 

3) Collect 5 ml of bacterial cells in a plastic 
Oakridge tube and spin for 5 min. at 5000 rpm in a Sorvall 
SS-34 rotor at 4 # C. 

4) Remove the supernatant. 

5) Resuspend the pellet in 1 ml of lysis buffer (SO mM 
glucose, 25 mM Tris-HCl[pH 3.0], 10 mM EDTA and 5 mg/ml 
lysozyme) , vortex for 5 seconds, and incubate at room 
temperature for 5 min. 

6) Add 2 ml of freshly prepared alkaline solution (0.2 
N NaOH, 1% sodium dodecyi sulfate) , tightly secure lid, mix by 
inverting 5 times and place tube in an ice-water bath for 5 
min. 

7) Add 1.5 ml of ice-cold 7.5 M ammonium acetate (pH 
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7.6) to the solution, mix by inverting the tube gently 5 times 
and place on an ice-water bath for 5 min. 

8) Centrifuge mixture at 9000 rpm for 10 min. at room 
temperature. 

9) Transfer clear supernatant to a IS ml Corex tube and 
add 0.S volumes of isopropanol (approx. 2.5 ml). Let sit at 
room temperature for 10 min. 

10) Centrifuge the mixture at 9000 rpm for 10 min. at 
room temperature and discard the supernatant. 

11) Resuspend the pellet in 300 ul of TE buffer. Add S 
ul of a stock of RNase A & Tl (made as a 200 ul solution by 
adding 180 ul of RNase A [3254 Units/mg protein, 5.6 mg 
protein/ml] and 20 ul of RNase T1C481 Onits/ug protein, 1.2 mg 
protein/ml] ) . These stocks may be purchased from USB (US 
Biochemical) . Transfer to a microcentrifuge tube and incubate 
at 37 *C for 15 min. ,? 

12) Add 75 ul of distilled water and 100 ul of 7.5 M 
ammonium acetate and incubate in an ice-water bath for 10 min. 

13) Centrifuge the mixture at 14, 000 rpm for 10 min. in 
a BecJanan microfuge at 4 # C. 

14) Precipitate by adding 2.5 volumes of 100% BtOH 
(approx. 1 ml) and incubate in an ice-water bath for 10 min. 

15) Spin at 14, 000 rpm for 10 min. in a microfuge. 

16) Wash pellet with 70% ethanol (using 0.5 ml-1 ml) . 
Dry the pellet and resuspend in 100 ul of IX New England 
Biolabs restriction enzyme Buffer 4 [20 mM Tris-HCl (pH 7.9), 10 
mM magnesium acetate, 50 mM potassium acetate, 1 mM DTT] . 
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Measure concentration and check purity by spectrophotometry at 
absorbances 2S0 and 280 nm. 

For a more rapid determination as to whether or not a 
particular bacterial colony harbored a recombinant plasmid, a 
PC* miniscreen procedure is carried out using a modification of 
the method described by (Sandhu, G.S. et al., 1989, 
BioTechniques, 7:689-590). Briefly, the following mixture is 
prepared: 

100 ul primer mix above, 20 uM each primer, 
100 ul dNTP mix (2.5 mM each) 

100 ul 10X AmpliTaq buffer (PerJcin-Slmer Cetus, IX 
buffer = 10 mM Tris-HCl P H 8.2, 50 mM KC1, 1.5 mM MgC12, and 
0.01% gelatin) 

700 ul deionized water. 

20 ul of the above mixture is put into a a 0.5 ml 
polyproplyene PGR tube. A transformed bacterial colony is 
Picked with a toothpick and resuspended in the mixture. The 
tube is put in a boiling water bath for 10 minutes and then 
cooled to room temperature before adding 5 ul of the mix 
described below: 

265 ul deionized water 

30 ul 10X Amplitaq buffer (Perkin-Slmer Cetus, IX 
buffer - 10 mM Tris-HCl pH 8.2, 50 mM XCi, 1.5 mM MgC12, and 
0.01% gelatin) 

7.5 ul Taq polymerase 
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The samples are overlaid with 50 ul of mineral oil and 
PCS. is carried QUt for 30 cycles using the following 
parameters : 

denature: 94 • for 1 min 

anneal: 55 • for 1 min 

extend: 72 • for 45 seconds. 

After PCR amplification, 1 ul of loading dye (30% 
glycerol, 0.25% Bromopbenol blue, 0.25% xylene cyanol) is added 
to the whole reaction and 20 ul of the mixture is loaded on a 
2% Nusieve, 1% agarose gel to see if there is a ?CR product of 
the expected si2e. 

This procedure is used as an initial screen. Minipreps 
are subsequently carried out to confirm the structure of the 
plasmid and its insert prior to sequencing. 

SXAMPLS 2: AMPLIFICATION AND ASSEMBLY OF BACH QUARTER 
Cloning fragments of the synthetic Bt crylA(b) gene: 

The synthetic gene was designed to be cloned in four 
pieces, each roughly one quarter of the gene. The oligomers 
for each quarter were pooled to either be assembled by PCR, 
hybridization, or a combination of hybridization followed by 
PCR amplification as described elsewhere . Synthetic quarters 
were pieced together with overlapping restriction sites Aat II, 
Ncol, and Apa I between the 1st and 2nd, 2nd and 3rd, and 3rd 
and 4th quarters respectively. 

Each quarter of the gene (representing about 500 bp) 
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was assembled by hybridizing :fle appropriate oligoaers and 
amplifying the desired fragment using =ca primers specific -or 
the «ds of that quarter. Two different sets of Pca rsacsicns 

employing two sets of slightly di-' SJ -ont- 

aj-j-got-y ai^.erent primers were used. 

The PCS products of the two - 6ae r; nf , e 

.wo reactions were designed to be 

identical except that in the first reaction there was an 
additional ^ mce at ^ ^ gf ^ ^ ^ ^ 

- the second reaction there was an AGCT sequence at the 3' end 
of a given quarter. ^ ^ ^ ^ ^ 

• Particular quarter were .^ed .after retina the pol^ase. 
Primers and incomplete products, , d^atur*. and subsequently 
re-annealed, a certain ratio (theoretically 50%, of the 
annealed product should have non-homologous overhanging ends 
these ends were designed to correspond to the -sticsy ends- 
forced during restriction digestion with ScoRX at the 5- end 
and Hind in at the 3- end of the molecule. The resulting 
molecules were phosphorylated, lig«ed into an ZcoAI/Hindlll 
nested and phosphatase sluescript vector, and transform 
«o ta. competent t coli strain OHSalpha. «t.r 
Election, the caii colonies containing the dMirad fragment 
«e identified by restriction digest patterns of the DNA 
inserts representing parts of the synthetic gene are 
subsequently purified and sequenced using standard procedures. 

In all cases, clones from muitiala =rc roa( -^ B 

- ■ La reactions are generated 
and sequenced. The quart, rs lr9 ta . n ^ ^ ^ 

unique restriction sites at the junctions to obtain the 
complete gene. 
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Cloned quarters are identified by mini-screen 
procedures and the gene fragment sequenced. It is found tiiat 
errors are frequently introduced into the sequence, most 
probably during the PCR amplification steps. To correct such 
errors in clones that contain only a few such errors f 
hybridized oligomers are used. Hybridized fragments are 
digested at restriction enzyme recognition sites within the 
fragment and cloned to replace the mutated region in the 
synthetic gene. Hybridized fragments range from 90 bp in 
length (e.g. the region that replaces the fragment between the 
Sac II sites in the 2nd quarter) to the about 350 bp 4th 
quarter fragment that replaces two PCR induced mutations in the 
4th quarter of the gene. 

Due to the high error rate of PCR, a plasmid is 
designed and constructed which allows the selection of a cloned 
gene fragment that contains an open reading frame. This 
plasmid is designed in such a manner that if an open reading 
frame is introduced into the cloning sites, the transformed 
bacteria could grow in the presence of Jcanamycin. The 
construction of this vector is described in detail below. This 
selection system greatly expedites the progress by allowing one 
to rapidly identify clones with open reading frames without 
having to sequence a large number of independent clones. The 
synthetic quarters are assembled in various plasmids, including 
BSSX (Stratagene; La Jolla, Ca> , pOC18 (Sambrook et al.), and 
the Km-expression vector. Other suitable plasmids, including 
pOC based plasmids, are fcnown in the art and may also be used. 
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Complete sequence, of cloned fragments, wsze „ ^ 
or cloned ..„ e products, ^ bi0M?ays using 

corn borer as the test insect v, r if y :hat ; . Jily £unc ._ ional 
synthetic at crylAlb, o TO . s have been obtained. 

construction of the Xm-e^ession sector to select open reading 

frames : 

The Xm-expression vector is desired to select for fragments o f 
the synthetic gen. which contain op.n-r.adin, frames. Pea 
oligomers are designed which alio, the fusion =, the m g«. 
from Tn5 starting at nuciectide 13 (R . iss et al., smbo , 
3:33 17 -33« „„,„ ^ pOC;3 ^ ^ 

»tes between the DHA segments. The pointer region contains 
restriction sites to alio, cloning various synthetic Bt IP 

fragments in- frame with rha ~ 

»»e witn the xm gene. The 88 bp 5' oligomer 

containing the polylinter region is purified on a S% 
POlyac-ylamide ge! as described above for th. oligomer pass 
Purification. A PC* reaction is assembled with a i Kb Bgl 
msma I template fragment which contains th. HPT II gene 
=*rived from Tns. The PC* reaction mi, contains 100 n, of 
template with !00 pmols of oligomers KE72A28 and (see 
sequences below,. 200 m » d 2 . 5 Qf T3q 

all in a 50 ul volume wi-n 

oxume wz.uh an equal volume of mineral oil 

overlaid. Sequences of the primers are: 
XE74A28 

-GCAGATCTGS ATCCATGCAC GCC5TGAAGG GCCCTTCTAG AAG5CC7ATC 
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GATAAAGAGC TCCCCGGGGA TGGATTGCAC GCAGGTTC-3' 
KE72A28 

5 ' -GCGTTAACAT GTCGACTCAG AAGAACTCGT CAAGAAGGCG- 3 ' 

The ?CR parameters used are: 94 # C for 45 seconds (sec), 
55 # C for 45 sec, and 72 *C for 55 sec with, the extension at step 
3 for 3 sec for 20 cycles. All ?CR reactions are carried out 
in a Perkin-Slmer Cetus thermocycler. The amplified ?CR 
product is 800 bp and contains the polylinker region with a 
translational start site followed by unique restriction sites 
fused in-frame with the Km gene from base f 13 running through 
the translational terminator . pUC:KM74 is the Km-expression 
cassette that was assembled from the 800 bp Bgl II\Sal I 
polylinicer/Km fragment cloned in the PUC18 vector. The lacZ 
promoter allows the Xm gene to be expressed in coll . 
pOC:KM74 derivatives has to first be plated on L3-agar plates 
containing 100 ug/ml ampicillin to select transf ormants which 
can subsequently be screened on LB-agar plates containing 25 
jig/ml Jcanamy cin/ IPTG . Synthetic Bt IP gene fragments are 
assembled from each quarter in the Km-cassette to verify 
cloning of open-reading- frame containing fragments pieces. The 
first ECS active synthetic 3t IP gene fragment, pBt:Km#6, is a 
Bt I? gene that shows Km resistance. This fragment is 
subsequently discovered to contain mutations in the 3rd and 4th 
quarter which are later repaired. 
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EXAMPLE 2A: SYNTHESIS AND CLONING OF THE FIRST QUARTER OF THE 
SYNTHETIC GENE [base pairs 1 to 5501 

The following procedures are followed in order to clone 

the first quarter of the synthetic DNA sequence encoding' a 

synthetic Bt crylA(b) gene. The same procedures are 

essentially followed for synthesis and cloning of the other 

quarters, except as noted for primers and restriction sites. 

Template for Quarter 1; Mixture of equal amounts of purified 
oligomers U1-U7 and LI. to L7 

PCR Primers: 
Forward: 

PI (a): 5'-GTCGACAAGG ATCCAACAAT GG-3' 

PI <b) : S'-AATTGTCGAC AAGGATCCAA CAATGG-3' 



Reverse: 

P2 (a) : 5'-ACACGCTGAC GTCGCGCAGC ACG-3' 
?2 (b) : 5'-AGCTACACGC TGACGTCGCG CAG-3' 

Primer pair Al: PI (b) + ?2(a) 
Primer pair A2: PI (a) + P2(b) 

The PCR reaction containing the oligomers comprising 
the first quarter of the synthetic maize-optimi2ed 3t IP gene 
is set up as follows: 
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200 ng oligo mix (all oligos for the quarter mixed in 
equal amounts based on weight) 

10 ul of primer mix (1:1 mix of each at 20 uM; 
primers are described above) 

5 ul of 10X PCS. buffer 

PCS. buffer used may be either 

(a) IX concentration - 10 mM KC1, 10 mM (NH4)2S04, 20 
mM Tris-HCi, ?H 8.0, 2 mM MgS04 f and 0.1% Triton X-100), or 

(b) IX concentration = lOmM Tris-HCi pH 8.3, 50 mM KC1 
l.SmM MgCi 2 , 0.01% wt/vol gelatin. 

Components are mixed, heated in a boiling water bath 
for 5 minutes, and incubated at 65 »C for 10 minutes. 

Next, the following reagents are added: 

8 ul of dNTPs mixture (final concentration in the 
reaction = 0.2 mM each) 

5 units polymerase. 

The final reaction volume is 50 microliters. 

Oligomers are then incubated for 3 min at 72 # C and then 
a PCR cycle is run. The PCS. reaction is run in a Perkin Elmer 
thenno cycler on a step cycle protocol as follows: 

denaturation cycle : 94 • for 1 minute 

annealing cycle : 60* for 1 minute 

extension cycle : 72* for 45 seconds (+ 3 sec per 



cycle) 



number of cycles : 



15 
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After the reaction is complete, 10 ul of the PCR 
reaction is loaded on a 2% Nusieve-GTG (PMC) , 1% agaros 
analytical gel to monitor the reaction- The remaining 40 ul is 
used to clone the gene fragments as described below. 

PCR Products 

The termini of the double stranded PCR product 
corresponding to the various primer pairs are shown (only upper 
strand) : 

GCG7GT (554 bp) first qtr. 

GCGTGTAGCT (554 bp) first qtr. 

Hybridization 

40 ul of each of the PCR reactions described above is 
purified using a chromaspin 400 column (Clonetech, Palo Alto, 
CA) according to manufacturers directions. Five ug of carrier 
DNA was added to the reactions before loading on the column. 
(This is done for most of the cloning. However f in some 
reactions the PCR reactions are phenol : chloroform extracted 
using standard procedures (Sambrook et al.) to remove the Taq 
polymerase and the PCR generated DNA is recovered from the 
aqueous phase using a standard ethanol precipitation 
procedure.) The carrier DNA does not elute with the PCR 
generated fragments. The Al and A2 reaction counterparts for 
each quarter are mixed, heated in a boiling water bath for 10 



-Al AATTGTCGAC 
A2 GTCGAC 
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minutes and then incubated at 65 «C overnight. Th reactions 
are then removed from the 65- hath and ethanol precipitated 
with 1 ul (20 ug) of nuclease free glycogen (Tracy, Preo^ 
aiochem. 11:251-268 (1981) as carrier. The pellet is 
resuspended in 40 ul of deionized water. 

Phosphorylation reaction 

The phosphorylation reaction is carried out as follows: 

40 ul DNA 

2.5 ul 20 mM ATP 

0.5 ul 10X 3SA/DTT (IX = 5 mM DTT, 0.5 mg/ml 3SA) 
1.0 ul 10X polynucleotide Jcinase buffer (IX - 70 

mM Tris.HCi, 

pH 7.6, 0.1 M XCi, 10 mM MgC12) 

2.0 ul polynucleotide kinase (New England Biolabs, 

20 units) . 

Incubation is for 2 hours at 37 »c. 

The reaction is then extracted one time with a 1:1 
phenol: chloroform mixture, then once with chloroform and the 
aqueous phase ethanol precipitated using standard procedures. 
The pellet is resuspended in 10 ul of TE. 

Restriction Digests 

20 ug of Bluescript vector (BSSK+, Stratagene, La 

Jolla, CA) 

10 ul 10 X restriction buffer (IX - 20 mM Tris-HCl pH 
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8,0, 10 mM MgC12, 100 mM NaCI) 

5 ul Eco RI (New England 3iolabs) 100 units 

5 ul Hind III (New England Biolabs) 100 units 

Final reaction volume is 100 ul- 

Incubation is for 3 hours at 37 •. 
When completed, the reaction is extracted with an equal volume 
of phenol saturated with TE (10 mM Tris.HCl pH 8.0 and 10 mM 
EDTA) . After centrifugation f the aqueous phase was extracted 
with an equal volume of 1:1 mixture of (TE saturated) 
phenol: chloroform (the "chloroform" is mixed in a ratio of 24:1 
chloroform :isoamyl alcohol), and finally the aqueous phase from 
this extraction is extracted with an equal volume of 
chloroform. The final aqueous phase is ethanol precipitated 
(by adding 10 ul of 3 M sodium acetate and 250 ul of absolute 
ethanol f left at 4* for 10 min and centrifuged in a micro fuge 
at maximum speed for 10 minutes. The pellet is rinsed in 70% 
ethanol and dried at room temperature for 5-10 minutes and 
resuspended in 100 ul of 10 mM Tris.HCl (pH 8.3). 

Phosphatase reaction 

Vector DNA is routinely treated with phosphatase to 
reduce the number of colonies obtained without an insert. Calf 
intestinal alkaline phosphatase is typically used (Sambrook et 
ai.}, but other phosphatase enzymes can also be used for tiiis 
step. 

Typical phosphatase reactions are set up as below: 
90 ul of digested DNA described above 
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10 ul of 10X calf intestinal alkaline phosphatase 
buffer (1X-50 mM Tris-HCl (pB 3.3), 10 ndf MgC12, 1 mM ZnC12, 10 

mM spermidine) 

1 Ul (1 unit) of calf intestinal alkaline phosphatase 

(.CIS, Boehringer Mannheim, Indianapolis, IN) 

Incubation is at 37 «C for 1 hour. 

The DNA is then gel purified (on a 1% low gelling 
temperature (LGT) agarose gel) and the pellet resuspended in 50 
Ul TE. After electrophoresis, the appropriate band is excised 
from the gel using a razor blade, melted at 65* for 5 minutes 
and diluted 1:1 with TE. This solution is extracted twice with 
phenol, once with the above phenol : chloroform mixture, and once 
with chloroform. The final aqueous phase is ethanol 
precipitated and resuspended in TE buffer. 

Ligation: 

To ligate fragments of the synthetic gene into vectors, 
the following conditions are typically used. 

5 ul of phosphorylatad insert DNA 

2 ul of phosphatased sco RI/Hind III digested 
Bluescript vector heated at 65- for 5 minutes, then cooled 

1 ul 10X ligase buffer (IX buffer=30 mM Tris.HCl (pH 
7.8), 10 mM MgC12, 10 mM DTT, 1 mM ATP) 

1 ul 3SA (1 mg/ml) 

1 ul ligase (3 units, Promega, Madison, Misc.) 
Ligase reactions are typically incubated at 16* 
overnight or at room temperature for two hours. 
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Transformation ; 

Transformation of ligated DNA fragments into coli is 
performed using standard procedures (SambrooJc et al.) as 
described above. 
Identification of recombinants 

White or light blue colonies resulting from overnight 
incubation of transformation plates are selected. Plasmids in 
the transformants are characterized using standard mini -screen 
procedures (Sambrook et al.) or as described above. One of the 
three procedures listed below are typically employed: 

(1) boiling OKA minipre? method 

(2) PCR minis creen 

(3) Ammonium acetate miniprep. 

The restriction digest of recombinant plasmids believed 
to contain the first quarter is set up as follows: 

(a) 3am.SI/Aat II digest: 10 ul DNA + 10 ul IX New 
England Biolabs restriction enzyme Buffer 4 

0.5 ul Bam HI (10 units) 

0.5 ul Aat II (5 units) 

Incubation is for about 2 hours at 37 *C. 

Clones identified as having the desired restriction 
pattern are next digested with Pvu II and with Bgl II in 
separate reactions. Only clones with the desired restriction 
patterns with all three enzyme digestions are carried further 
for sequencing. 

Sequencing of cloned gene fragments: 
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Sequencing is performed using a modification of 
Sanger's dideoxy chain termination method (Sambrook et al.) 
using double stranded DNA with the Sequenase 2 kit (United 
States Biochemical Corp., Cleveland, OH). In all, six first 
quarter clones are sequenced. Of the clones sequenced, only 
two clones designated pQAl and pQAS are found to contain only 
one deletion each. These deletions are of one base pair each 
located at position 452 in pQAl and position 297 in pQA5. 

Plasmid pQAl is used with pPl-8 (as described below) to 
obtain a first quarter with the expected sequence. 



EXAMPLE 2B: SYNTHESIS AND CLONING OF THE SECOND 
QUARTER [base pairs 531 to 10501 

Template: oligomers U8-U14 and L8-L14 

PGR Primers: 

forward: 

P3 (a): S'-GCTGCGCSAC GTCAGCGTGT TCGG-3' 
P3 (b) : 5' -AATTGCTGCG CGACGTCAGC GTG-3' 



Reverse: 

P4 (a) : 5 7 -GGCGTTGCCC ATGGTGCCGT ACAGG-3' 
?4 (b) : 5' -AGCTGGCGT TGCCCATGGT GCCG-3' 

Primer pair 31: P3 (b) + ?4 (a) 
Primer pair 32: P3(a) + ?4 (b) 

PCR Products 
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Bl AATTGCTGCG AACGCC (524 bp) second 

quarter 

32 GCTGCG AACGCCAGCT (524 bp) 

Hybridization , PCR amplification r spin column size 
fractionation, and cloning of this gene fragment in Bluescript 
digested with Eco RI/Hind III are performed as described above 
for the first quarter (Example 2A) ♦ The PCR product for this 
quarter is about 529 bp in size representing the second quarter 
of the gene (nucleotides 531 to 1050) . Transformation is into 
frozen competent coli cells (DHSalpha) using standard 
procedures described above (SambrooJc et al.) 

Miniscreen of pQB clones: 

Miniprep DNA is prepared as described above and 
digested with (a) Aat II/Nco I, (b) Pvu II and (c) with Bgl I 
to confirm the structure insert in the vector before 
sequencing. 

Sequencing is performed as described above using the 
dideoxy method of Sanger (Sambrook et al.) . 

A total of thirteen clones for this quarter are 
sequenced. The second quarter consistently contains one or 
more deletions between position 884 and 887. In most cases the 
G at position 884 is deleted. 

Piasmid pQBS had only one deletion at position 884. 
This region lies between two Sac II sites (positions 859 and 
949) . Correction of this deletion is described in Example 3. 
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Clones of the firsr half (1-1050 bp) - 

A fragment for cloning the first half (quarters 1 and 
2) of the synthetic at mai2e gene as a single DNA fragment is 
obtained by restriction digestion of the product of a ?CR 
reaction comprising the first quarter and the second quarter. 
Restriction endonuclease Aat II is used to cut the DNA 
(following phenol extraction and ethanol precipitation), in a 20 
Ul reaction. 15 ul of each of the Aat II digested quarters is 
mixed and ligated (in a 50 ul volume by adding 5 ul of 10X 
ligase buffer, (1X=30 mM Tris-HCi ?H 7.8, 10 mM MgC12, 10 mM 
DTT, 1 ATE) 14 ul of deionized water and 1 ul of T4 DNA 
ligase, 3 units, Promega, Madison, WIJ at room temperature for 
2 hr. The result is an about 1 kb fragment as judged by 
electrophoresis on a 1% agarose gel run using standard 
conditions (Sambrook et al.) Ten ul of the ligation product is 
amplified by PGR using conditions described previously except 
that only 5 cycles were run. 

Primer Pair: HA- PI (a) + ?4(b) 
Primer Pair: HB- PI (b) + ?4 (a) 

The product of these reactions is cloned into 
31uescript (Stratagene, La Jolia, CA) as described for the 
individual quarters. This procedure is only done once i.e., 
all insert DNA is obtained in a particular region from a single 
PCR reaction. 

Thirty-six colonies are minis creened with Sal I digests 
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and Pvu II digests. All except 4 contain an insert of 
approximately 1 )cb in size of which at least 20 contain the 
correct Pvu II digestion pattern. Sight of these clones are 
selected for sequence analysis. One of the clones, Pl-3, has 
the desired sequence between the 3co NI site (396 bp) and the 
Dra III site (640 bp) . This clone is used to obtain a plasmid 
with the desired sequence up to the Dra III site (640 bp) in 
the second quarter with pQAl (first quarter with a deletion at 
position 452 bp described previously.) 



EXAMPLE 2C: CLONING AND SYNTHESIS OF THIRD 

QUARTER fbase oairs 1021 to 15001 

Template: Oligos U15-U20 and L15-L21 

PGR primers : 

forward 

?5 (a): 5'-TTCCCCCTGT ACGGCACCAT GGGCAACGCC GC-3' 
?5 (b) : 5 ' -AATTGTACGG CACCATGGGC AAC-3' 



reverse 

P6 (a): 5'-GAAGCCGGGG CCCTTCACCA CGCTGG-3' 
P6 (b) : S'-AGCTGAAGCC GGGGCCCTTC ACC-3' 

Primer pair CI: P5(b) + P6(a) 
Primer pair C2: ?5(a) + P6(b) 

PGR Product: 

CI AATTGTACGG GGCTTC (475 bp) 3d qtr* 
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C2 TTCCCCTGTACGG G GCTTCAGCT (484 bp) 3d qtir 



?C3. reacrions, spin column recovery of the correct 
sized DNA fragment, and ligation into vectors are performed as 
described above (Example 2A> using a Bluescript vector cut with 
Sco RX and Hind III. The approximately 479 base pair PCR 
product represents the third quarter of the synthetic gene (NT 

1021 - 1500) . 

Transformation into frozen competent coli strain 
DH5 alpha cells, selection and identification of transformants, 
characterization of transformants by mini-screen procedures, 
and sequencing of the synthetic gene fragment in the vector are 
all as described above. 

Mini screen of pQC clones: 

The third quarter is miniscreened using standard 
procedures (Sambroofc et al.J. Miniprep DNA is cut with (a) Nco 
I/Apa I and (b) with Pvu II. Clones containing the correct 
restriction digest patterns are sequenced using standard 
procedures. A total of 22 clones of the third quarter are 
sequenced. Three major deletion w hot3pots " in the third 
quarter are identified (a) at position 1083 (b) between 
position 1290-1297 and (c) between positions 1356-1362. In all 
clones except one r pQCS, there is also consistently an 
insertion of a C at position 1365. In addition to these 
mutations, the third quarter clones contain a large number of 
other apparently random deletions. The common factor to the 
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three mutational "hotspots" in the third quarter and tlx one in 
the second quarter is that thes regions are ail flanJced on 
either side by sequences that are about 80% C+G. Other regions 
containing 5 to 9 C-Gs in a row are not affected. The 
oligomers in 015, 016, 018, 019, LIS, L16, L18 and Li 9 are 
redesigned to reduce the C+G content in these regions. Five 
clones each from PCS reaction using the modified oligomers are 
sequenced. 

Plasmid pQCN103 has the correct sequence for the third 

quarter except for a change at position 1226. This change, 

which substitutes a G for a C, results in the substitution of 

one amino acid (leucine) for the original (phenylalanine) . 

EXAMPLE 2D: SYNTHESIS AND CLONING OF FOURTH 
QUARTER fbase oairs 1480 to 19601 

The fourth quarter of the gene is obtained from a clone 

which is originally designed to comprise the third and fourth 

quarters of the gene. The "second half" of the synthetic gene 

is obtained from PCR reactions to fuse the third and fourth 

quarters. These reactions are run with PCR primers P5 (a) and 

P6 (a) described above for the third quarter and primers P7 (a) 

and P8 (a) (described below) . The reverse primer- is modified to 

include a Sac I site and a termination codon. Separate 

reactions for each quarter are run for 30 cycles using the 

conditions described above. The two quarters are joined 

together by overlapping PCR and subsequently digested with 

restriction enzymes Nco I and Sac I. The resulting 953 bp 

fragment is cloned directionally into pCIB3054, which has been 
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cut with Nco I/Sac I and treated with alkaline phosphatase. 

pCIB3054 is constructed by inserting intron #9 of 
PEPcarhoxylase (PEPC ivs #9) in the unique Hpa I site of 
PCIB246 (described in detail in Example 4) ?CI3246 is cut with 
Hpal and phosphatased with CIP using standard procedures 
described in Example 2A. PEPC ivs #9 is obtained by PCS. using 
pPEP-10 as the template. pPEP-10 is a genomic subclone 
containing the entire maize PEP carboxylase gene encoding the 
C 4 photosynthetic enzyme, plus about 2.2 Kb of 5' -flanking and 
1.8 Kb of 3' -flanking DNA. The 10Kb DNA is ligated in the 
Hindlll site of pOClS. (Hudspeth et al., Plant Molecular 
Biolooy, 12: 576-589 (1989) . The forward PCS. primer used to 
obtain the ?EPCivs#9 is GTACAAAAACC3GCAACTC and the reverse 
primer is C7GCACAAAGTGGAGTAG7 . The PCX product is a 108 bp 
fragment containing only the PEPcarhoxylase intron #9 
sequences. The PCS. reaction is extracted with phenol 
and chloroform, ethanol precipitated phosphorylated with 
polynucleotide kinase and treated with T4 polymerase to fill in 
the 3' nontemplated base addition found in PCS. products (Clark, 
j.M., Mncleic Acid Research , 16: 9677-9686 (1988)) using 
standard procedures. The kinased fragment is blunt-end cloned 
into the Hpal site of pCIB246, using standard procedures 
described earlier. 

Amplification and Assembly of the Fourth Quarter 
Template: U2 1-025 and L22-L28 
PCS, arimers 
FORWARD 
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P7 (a) : 5' -TGGTGAAGGG CCCCGGCTTC ACCGG-3' 
REVERSE 

?8 (a) : 5'-ATCATCGATG AGCTCCTACA CCTGATCGAT GTGGTA-3' 
PRIMER PAIR 4: P7(a) + ?8 (a) 
PRIMER PAIR 3: PS (A) + P6(a) 

Primer pair for overlapping PCR : P7(a) + P8 (a) 
PCR Product 

fourth quarter: GGTGAA ATCAGGAGCTCATCGATGAT 

(484 bp) third quarter: TTCCCCCTGTA ■ TTCACCGG 

(484 bp) second half: GGTGAA CATGATGAT (953 bp) 

Four positive clones are identified by pla sm id 
miniscreen and are subsequently sequenced using sta n da r d 
procedures. 

Plasmid Bt.P2 #1 contains approximately the correct 
fourth quarter sequence except for two mutations. These 
mutations are at position 1523 (substituting an A for a G, 
resulting in an amino acid change which substitutes a His for 
an Arg) and at position 1S34 (substituting a T for a C, 
resulting in an amino acid substitution of a Ser for a Thr) . 

Plasmid Bt.P2 #1 is used in the construction of 
PCI54414 described below. (The mistakes are ultimately 
corrected by hybridizing all the oligos of the fourth quarter, 
digesting with Apa I/Bst E II and replacing that region in 
pCIB4414. Therefore, only sequences from position 1842-1960 
remain of Bt-P2*l in the final construct.) 
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««MPCB 3: ASSW-MT AND 3EP> ™ OF THE ?INAL SYNTHETIC GENE 

The synthetic maize optimized Bt crylA(b) gene is 
designed to be cloned in quarters. Using the PCR technique, 
however, results in mutations, which in most cases are 
deletions resulting in framesfaift mutations. Plasmids 
containing individual quarters are therefore sequenced and the 
correct parts ligated together using standard procedures. 

After obtaining first and second quarter clones with 
almost the desired sequence, plasmids ?EB1Q#4 and ?EB1Q#5 are 
constructed to obtain the desired sequence of the synthetic Bt 
gene up to the Dra III site at the base pair position 534 (this 
mutation destroys the Dra III site) . The pEBlQ constructs are 
made by ligating a 3.9 Kb Eco NI\Bam HI fragment from pPl-8 
with a 400 bp fragment from pQAl. ?EB1Q#5 has the desired 
sequence up to the Dra III site, but ?EB1Q#4 has a mutation at 
base pair position 378. 

Plasmids ?1H1M4 and plHlMS are constructed to repair 
the Dra III site in P EB1Q#4 and pEBlQ#5. Plasmids ?1H1M#4 and 
#5 are made by ligating a 3.5 Kb Nco I\Aat II fragment from 
pEBlQM and #5 respectively, with a 500 bp Nco I\Aat II 
fragment from pQB5. Plasmid plHlM5 contains a mutation between 
the Sac II sites at position 884 in the second quarter of the 
synthetic Bt gene. Plasmid plHlM4 contains the additional 
mutation as described in its precursor construct ?EB1Q#4. 

The Sac II site in the Bluescript vector region of 
P1H1M4 is deleted by cutting ?1H1M4 with Not I and sac I and 
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converting these sites to blunt ends using T4 DMA polymerase 
under standard conditions before ligating this 3.9 Kb fragment 
to make ?1H1M4*S. Deleting the Sac II site in the vector 
region allows the 90 bp Sac II fragment with the mutation at 
position 384 in the 2nd quarter of plHlM4*S to be removed prior 
to replacement with a 90 bp Sac II fragment. Oligomers 0\L 12 
and 13 are kinased and hybridized (described above) before 
cutting with Sac II and isolating a 90 bp fragment on a 2% 
Nusieve gel. The Sac II fragment is ligated into the about 3.8 
Kb Sac II cut plHlM4*S vector which has been phosphatased with 
CIP. The repaired Sac II construct is called pHYB2#6. The 
orientation of the Sac II fragment in pHYB2#6 is detected by 
PCR screening as described earlier using the following primers : 

MK23A28 = 5' -GGGGCTGCGGATGCTGCCCT-3' 
MK25A28 = 5' -GAGCTGACCCTGACCGTGCT - 3 ' 
MK26A28 - 5' -GACC7GATGGACATCC7G&A-3' 

Running the PGR reactions with 50 pmoles of primers 
MK23A28 and MK25A28 produces an approximate 180 bp fragment, 
indicating the inserted fragment bounded by the Sac II sites in 
?HYB2#6 is in the correct orientation. Using primers MK25A28 
and MK26A28 in the PCR screening acts as the negative control 
producing an approximate 180 bp fragment only in constructs 
containing the Sac II bounded fragment in the wrong 
orientation. pHYB2#6 sequence is determined using standard 
procedures. 
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P HY32#6 has one mutation at position 378 which needed 
to be repaired to obtain a first quarter containing the desired 
sequence . 

Plasmid plHG#6 contains the desired sequence for the 
entire first half of the synthetic Bt gene. ?1HG#6 is made 
from a 3.4 Kb Aat IINHco I fragment of plHlM5#2 ligated to a 
500 bp Aat II\Nco I fragment from ?HYB2#6. 

To identify clones or partial clones of the synthetic 
gene which contain open reading frames, the Jcanamycin selection 
vector, (described above) is used. The fourth quarter of the 
synthetic 3t gene is the first put into the Jcanamycin cassette. 
PKM74-4 contains the approximately 500 bp Apa I\Cla I fragment 
from plasmid 3t22 (which had been previously transformed into a 
dam- 2. coli strain (PO-100) to be able to cut with Cla I), 
ligated to ?DC:XM74 cut with Apa I\Cla I. Plasmid ?KM74-4 
displays Jcanamycin resistance but is later found to contain two 
substitution mutations at positions 1523 and 1634 (mutations 
are described above in the section on cloning the fourth 
quarter; they are substitutions, not deletions or insertions) . 

The correct first half of the synthetic Bt gene from 
plasmid plHG#6 is inserted into plasmid pKM74-4 . The resulting 
plasmid, called pKml24, is made from the about 3.9 Kb Apa I\Bam 
HI fragment derived from pKM74-4 ligated to 1 Kb Apa I\Bam HI 
fragment from ?1HG*6. pKml24 shows Jcanamycin resistance. This 
plasmid contains the first, second, and fourth quarters of the 
synthetic gene forming a single open reading frame. 

The third quarter of the synthetic gene is next cloned 

70 



WO 93/07278 



PCT/US92/G8476 



into pKM124, The firsc functional clone, in piasmid pBt:Km#6, 
is a functional copy of the truncated synthetic crylA(b) gene 
in the Km-cassette which displays Jcanamycin resistance but 
which contains deletion mutations between the third and fourth 
quarters. Piasmid pBt :Xm#6 is made from the approximately 5 Kb 
Apa I\Nco I ?KM124 vector fragment ligated to the approximately 
500 bp Apa I\Nco I fragment from pQCN103 (pQCN103 contains a 
mismatch mutation at position 1326 which is repaired later) . 
Contaminating nuclease activity appears to have deleted the Apa 
I site between the third and fourth quarters in pBt:Km#6. The 
Bt gene encoded by the synthetic gene in piasmid pBT:Xm#6 has 
about 50-60 % of the native proteins' activity against EC3. 
The 2 Kb Sma I\Bam HZ fragment from pBt:Km#6 is inserted into a 
35S: expression cassette to make a piasmid called 35S:3t6. 

Two functional synthetic Bt clones, each with 
mutations, are initially obtained: plasmids pBT:KM#6 and 
PCI34414. pCZB4414, which is 100% active in insect bioassays 
against European corn borer compared with the native gene, 
contains substitution mutations in the third and fourth 
quarters at positions 1323, 1523, and 1534. 

pCIB4414 is constructed from two plasmids, MG3.G4#18 
and 1HG which is described above. MG3.G4#18 is obtained by 
cloning the Apa I/Kpn I fragment in piasmid Bt.P2#l into 
PQCN103 (using those same restriction sites) . This produces a 
piasmid containing the third and fourth quarters of the gene. 
The first half of the synthetic gene from piasmid 1HG is cut 
with Bam HI and Nco I and moved into MG3.G4#18 (containing the 
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third and fourth quarters of the gene) . The resulting plasmid, 
PCIB4414, contains a functional version of the synthetic gene. 
While being functional, the synthetic gene in this plasmid 
contains three errors; position 1326 (G substituted for a C) , 
position 1523 (substitute A for a G) , and at position 1634 
(substitution of a T for a C) . 

The fourth quarter in pCI34414 is replaced with a 354 
bp fourth quarter Apa I\Bst 3 II fragment obtained from 
hybridizing, ligating, and restriction cleaving fourth quarter 
oligomers as described earlier, and isolating the fragment from 
a 2% Nusieve agarose gel. pCIB4408 is a synthetic Bt gene 
clone obtained by replacing the fourth quarter fragment in 
PCIB4414 with the hybridized fourth quarter fragment. To 
insert the CaMV 35S promoter in front of the synthetic Bt gene, 
PC1B4406 is made from a 4 Kb 3co HI\Kpn I fragment from plasmid 
p35SBtS and 1.8 Kb Eco NI\Kpn I fragment from pCI34408. 

pCIB4406 is 100% active (as compared with the protein 
from the native gene) against BC3 but contains the substitution 
mutation in the third quarter of the synthetic gene at position 
1323 resulting in an amino acid substitution of a leucine for a 
phenylalanine. Plasmid pBS123#13 is used to repair this 
mutation . 

The third quarter fragment in plasmid pBS123#13 is made 
from an approximately 479 bp hybridized oligomer generated 
fragment. Third quarter oligomers U15-O20 and L15-L21 are 
kinased, hybridized, and ligated as described above. PCR 
reactions are carried out as described above with primers ?5 (a) 
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and ?6<b) for IS cycles. The PCR product is treated with 
proteinase X at a final concentration of ab ut SO ug\ml in an 
approximate 95 ui volume for 30 minutes at 37 # C followed by 10 
minutes at 65 # C (Crowe et al., Nucleic Acid Research 19: 184 , 
1991,) Subsequently, the product is phenol \ chloroform extracted 
and ethanol precipitated using standard procedures before 
cutting with restriction enzymes Apa I and Nco I. 

The approximate 450 bp Apa I\Nco I ?CR fragment is 
ligated to the 3.8 Kb Apa I\Nco I vector fragment from plHG#6 
to make pBS123#13. Plasmid pBS123#13 contains the desired 
sequence for the third quarter of the maize optimized crylA(b) 
gene from position 1319 at the Nsp I site through the Apa I 
site at position 1493. This 170 bp Nsp I\Apa I fragment from 
pBS123*13 is used in the fully active synthetic crylA(b) gene 
in plasmid pCI34418. 

Western 31ot Analysis: 

Western blot analyses of various transf ormants are 
performed using crude extracts obtained from 3^ coli grown on 
selective plates. Using a toothpick, cultures are scraped from 
fresh plates containing the transf ormants of interest which 
have been grown overnight at 37 •c. The positive control for 
expression of the Bt gene in coli was a construct called 
?CI33069 which contains the native 3t-k gene fused with the 
plant expressible CaMV 35S promoter. ?CI33069 also contains 
the 35S promoter operably linked to the hygromycin resistance 
gene, 35S promoter, with Adh intron #1 operably linked to. the 
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CTS gene, and 35S promoter operably linked to a gene coding for 
tie production of the native 3t crylA(b) IP. A negative 
control of Sj, coli which does not contain a 3t gene is also 
included in the analyses. Cultures are resuspended in 100 ul 
of loading buffer containing 62 mM Tris-HCl pH 5.8, 1% SDS, 
0.0025% bromophenol blue, 10% glycerol and 7.5% 
mercaptoethanol. After heating the mixtures at 95 »C for 10 
minutes, the preparations are sonicated for 1-3 seconds. The 
debris is centrifuged in a microfuge at room temperature for 
about 5 minutes and 10 to 15 ul of each sample is loaded onto 
an acrylamide gel with a 10% running gel below a 5% stacking 
gel (Laemmli, Nature 227; S30-S85 (1970) ) . After electrophoresis 
overnight at 10 mAmps, proteins are transferred from the gel to 
an Immobilon membrane (Miilipore) . The transfer is done using 
an electrophoretic Blotting Unit (American BioNuciear, 
Emeryville, CA) in transfer buffer (20 mM Tris, 150 mM glycine, 
and 20% methanol) for 1.5 hours at 450 mAmps. 
Buffers for western blotting included: 
Blocking buffer: 2% Tween-20 

30 mM Tris-HCl pH 10.2 
150 mM NaCl 

Wash buffer: 0.0S% Tween-20 

30 mM Tris-HCl ?H 10.2 
150 mM NaCl 

Developing buffer: 100 mM Trisr-HCl pH 9.5 
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100 mM NaCI 
10 mM MgC12 

Aftar transfer is complete, the membrane is incubated 
for about ten minutes in the blocking buffer. Three IS minute 
washes with wash buffer are done before the first antibody 
treatment. The first antibody is an immunoaf f inity purified 
rabbit or goat antibody prepared using the CrylA(b) protein as 
the antigen (Ciba-Geigy, RTP, N.C.; Rockland Inc., 
Gilbert sville, PA. ; and 3erkeiey Antibody CO. , Richmond, CA.). 
The crylA(b) specific antibody is treated immediately before 
use with coli lysate from 3io-Rad in a 1 ml volume with 5 ug 
of antibody, 50 ul of S. coli lysate in the wash buffer 
solution. This mixture is incubated for 1 hour at room 
temperature before diluting it 1 to 30 for a final dilution of 
1:6000 with wash buffer. Incubation of the membrane with the 
first antibody is at room temperature for 1.5 hours. 

Three 10 minute washes are done between the 1st and 2nd 
antibody treatments . The second antibody is either rabbit 
anti-goat or goat anti- rabbit /alkaline phosphatase conjugate 
(Sigma, St. Louis, MO.). Incubation with the alkaline 
phosphatase conjugate is carried out at room temperature for 
one hour using a 1 to 6000 dilution in wash buffer. Six 10 
minute washes are done between the second antibody 
treatment and developing the western blot. The western blot is 
developed in 100 ml of developing buffer with 440 ]xl of 
nitroblue tetrazolium in 70% dimethyl formamide (75 mg\ml) , and 
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330 ul of 5-brom -4-chloro-indolyl-?hosphate in 100% dimethyl 
fonnamide (SO mg\mi) . After developing for 15 co 30 minutes, 
the membrane is washed in water and air dried. 

SXAMFLS 4: CONSTRUCTION OF TRANSFORMATION VECTORS 
Construction of pCI3710 and derivatives. 

CaMV 3SS Promoter Cassette Plasmids ?CI3709 and pCI3710 
are constructed as shown in Rothstein et al., Gene .53 : 153-151 
(1987) . pCIB710 contains CaMV promoter and transcription 
termination sequences for the 35S RNA transcript [Covey et al., 
Ntiel. Acids. Res., 9:6735-6747 (1981)]. A 1149 bp Bglll 
restriction fragment of CaMV DNA [bp 6494-7643 in Hohn et al., 
Current Topics in Microbiology and Immunology/ 96:194-220 and 
Appendices A to G (1982) ] is isolated from CaMV DNA by 
preparative agarose gel electrophoresis as described earlier 
The fragment is mixed with 3amHI -cleaved plasmid pOC19 DNA f 
treated with T4 DNA ligase f and transformed into coli. 
(Note the BamHI restriction site in the resulting plasmid is 
destroyed by ligation of the Bglll cohesive ends to the BamHI 
cohesive ends.) 

The resulting plasmid, called pOC19/35S r is then used 
in oligonucleotide-directed in-vitro mutagenesis to insert the 
3amHI recognition sequence GGATCC immediately following CaMV 
nucleotide 7483 in the Hohn reference. The resulting plasmid, 
pCIB710 r contains the CaMV 35S promoter region and 
transcription termination region separated by a 3amHI 
restriction site. DNA sequences inserted into this BamHI site 
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will be expressed in plants by these CaMV transcription 
regulation sequences. (Also note that pCIB710 does not contain 
any ATG translation initiation codons between tiie start of 
transcription and tie BamHI site) . 

PCIB710 is modified to produce pCIB709 by inserting a 
Bam HI fragment containing the coding sequence for hygromycin 
phosphotransferase from pLGSO [Rothstein et al., Gene, 
53:153-161 (1987)] in the 3am HI site. 

PCIB709 is modified to produce pCIB996 by removing the 
ATG just upstream from the initiation codon of the hygromycin 
phosphotranserase gene using standard mutagenesis techniques 
while inserting a 3gl II restriction site at this location. 
The resulting plasmid, pCIB996 f is further modified to remove 
the Bam HI, Sma I and Bgl II sites in the 5 ' untranslated 
leader region located 5' of the initiation codon for the 
initiation codon. The result is a change of DNA base sequence 
from -TATAAGGATC CCGGGGGCA AGATCTGAGA TATG-Hyg to -TATAAGGATC 
TGAGATATG-Hyg. The resulting plasmid is icnown as pCIB3073. 

Alternatively, pCIB710 is modified to produce pCIB900, 
by inserting the Bam HI - 3d I fragment of pCIB10/35SBt, which 
contains the 645 amino acid 3t coding sequence, described in 
Part C4 below, into the 3am HI site of pCIB710 to create 
pCIB710/35SBt. To introduce an antibiotic resistance marker, 
PCI3709 is cut with Sal I, a Kpn I/Sal I adaptor is iigated and 
the resulting ligation product is cut with Kpn I. The Kpn 
fragment of pCIB709 containing the 35S /hygromycin resistance 
gene is inserted into the Kpn I site of pCIB710/35SBt to 
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produce ?CI3900. 

Genes useful as the selectable marker gene include the 
hygromycin resistance gene described in Rothstein et al., Gene 
53: 1S3-1S1 (1987) . The hygromycin gene described in this 
reference is moved into a pOC plasmid such as ?CI3710 or 
?CIB709 and the "extra" ATG upstream from the hygromycin 
phosphotransferase coding sequence is removed to create 
PCIB996. This modified pCI3996 gene is further modified to 
remove a Bglll, Bamfll and Smal sites from the 5' region of the 
gene using standard techniques of molecular biology to maxe 
PCI33073 . 

pCIB932 is a pOC19-based plasmid containing the 
chimeric gene Pep-C:promoter\Bt\Pep-C: terminator. It is 
composed of fragments derived from pPEP-10, a Hindi "I subclone 
of a genomic clone, Hl-Iambda-14, ?MAS OSA , 83:2884-2888 
(1986) , of the maize gene encoding the PEP carboxylase enzyme 
active in photosynthesis, and from pCXB930, which is a 3amHI 
fragment containing the S45 amino acid truncated form of the 
the crylAb endotoxin gene in the 3amHI site of pOC18. 

The 2.5 kb EcoRI-XhoI fragment from pPEP-10, containing 
the polyA addition site from the PSP carboxylase gene, is 
isolated and digested with PstI and Hindi. The restriction 
digest is ligated with Pstl/HincII digested pOC18, transformed 
into E;. coli and transf ormants screened for those containing a 
412 bp Pstl-HincII insert in pUC18 and the insert verified by 
sequencing. The resulting plasmid is called pCIB931. 

The nuclear gene encoding the phosphoenolpyruvate 
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carboxylase isozyme ("Pep-C") is described in Hudspeth et al., 
Plant Molecular 3iolocrv , 12: 579-589 (1989) . ?CIB932 is 
constructed by the ligation of three fragments. The first 
fragment, containing the PEP-C transcription terminator, is 
produced by digesting ?CIB931 to completion with Hindi II, 
partially with Sphl and the 3098 bp fragment isolated. The 
second fragment, containing the at endotoxin coding sequence, 
is produced by digesting pCIB930 with Ncol and Sphl and 
isolating the 19S0 bp fragment. The third fragment, containing 
the PEP-C promoter, is produced by digesting pPEP-10 to 
completion with Hindlll, partially with Ncol and isolating the 
2.3 Jcb fragment. The ligation mix is transformed into 3^ coli , 
transf ormants with the correct insertion identified and the 
insert verified by sequencing. 

PCIB932 is cut with PvuII to generate a 4.9 Kb fragment 
containing the mai2e Pep-C:promoter\Bt\Pep-C: terminator and 
purified on a 1% LG7 agarose gel in IX TAE. The linearized 
PCI33079 vector and the 4.9 Kb insert from pCIB932 are iigated 
using T4 DNA ligase in LGT to make pCIB4401. pCI34401 is a 
maize transformation vector containing the chimeric genes: 
35S : promoter \PAT\ 35S : terminator , Pep-C : promoter \Bt\Pep-C : 
terminator, and 35S:promoter\AdhI #1 intron\GUS\35S : 
terminator.. 

Construction of pCIB246 (35S-GUS-35S) 

A CaMV 35S promoter cassette, pCIB246, is constructed 
as follows. 

The Ddel restriction site at nucleotide position 7482 
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of rue CaM<r genome [Franc* et al., Call, 21:285-294 (1980)] is 
modified by insertion of a 48 bp oligonucleotide containing 
several restriction enzyme sites including an Ncol (CCATGG) 
site, a Sail (GTCGAC) site, and an SstI (GAGCTC) site. This 
altered CaMV 35S promoter is inserted into a pUC19 vector that 
had been modified to destroy the vector's SstI and Sail sites. 
Thus, the CaMV 355 promoter of pCIB1500 contains unique SstI 
and Sail sites for cloning. 

pCXB1500 is digested with Sstl/Ncol and ligated with 
the GUS gene obtained from pBI221 (Clontech Laboratories, Inc., 
Palo Alto, CA) . The Ncol site is fused to the GUS gene such 
that the ATG of the Ncol site functions as the start codon for 
the translation of the GUS gene. The CaMV 35S polyadenylation 
and termination signals are used for the 3' end of the chimeric 
gene . 

Construction of pCI33069 <35S-Adhl-GUS-35S) 

pCI3246 is modified by adding the mai2e alcohol 
dehydrogenase gene Adhl intron number 1 (Adhl) (Dennis et al., 
Nucleic Acids Research , 12:3983-4000 (1984)) into the Sal I 
site of pCIB246 to produce plasmid pCIB3007. The Adhl intron 
is excised from the maize Adhl gene as a Bal I/Pst I fragment 
and subcloned into pOC18 that was cut with Sma I/Pst I to make 
a plasmid called Adh 1026. Adh 102S is cut with ?vu Il/Sac II, 
the fragments are made blunt ended with T4 DNA polymerase, Sal 
I linkers are added using standard procedures and a fragment of 
about 560 bp is recovered from a 3 % NuSeive gel and ligated 
into Sal I cut /phosphatase treated pOC18. The Sal I linkered 
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Adh intron #1 in the resulting plasmid is cut our with Sal I, 
gel purified, and ligated into Sal I cut/ phosphatase treated 
pCIB246 to make plasmid pCI33007. 

PCIB3007 is cut with PstI and the ends made blunt by 
using T4 DNA polymerase (NEW Sngland 3iolabs) according to the 
suppliers' specifications. The resulting blunt ended molecules 
are cut with Sph I and the approximately 5.8 Xb fragment with 
one blunt end and one Sph I end is purified on a low gelling 
temperature (LGT) agarose gel using standard procedures. 
PCI3900 is cut with Sma I /Sph I and the fragment containing the 
25S/Bt gene is purified on a LGT agarose gel. The two gel 
purified fragments are ligated in LGT agarose using T4 DNA 
iigase according to standard conditions. The resulting ligated 
fragments are transformed into coli using standard 
procedures and the resulting plasmid is called ?CIB3062. There 
are two versions of pCIB3062. pCIB3062#l has a Sma I site 
regenerated where the Sma I site and the T4 polymerase blunted 
ends are ligated. This most likely results from the T4 
polymerase nibbling a few base pairs from the Pst I site during 
the blunting reaction. pCIB3062#3 does not have this Smal 
site. 

pCI33062#3 is cut with Kpnl and made blunt-ended using 
T4 DNA polymerase, and subsequently cut with ?vu II to yield a 
S.4 Kb fragment with blunt ends containing the 35S/GOS and 
35S/Bt genes. This blunt-end fragment is ligated into Sma I 
cut pCIB3073 to produce pCI33063 or pCIB3069. pCI33069 
contains the same fragment used to make pCIB3063, but the 
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chimeric genes in pCIB3069 are all in the sane relative 
orientation, unlike those in ?CI33063. These plasmids contain 
a) a 35S promoter operably linked to the hygromycin resistance 
gene; b) a 35S promoter, with Adh intron #1, operably linked to 
the GUS gene; and c) a 35S promoter operably linked to a gene 
coding for the production of the synthetic crylA(b) 
insecticidal protein from Bacillus thurinoiensis , as described 
above . 
GUS Assays: 

GUS assays are done essentially as described in 
Jefferson, Plant Mol. 3io. Reporter , 5:387-405 (1987). As 
shown above, plasmid pCI3246 contains a CaMV 35S promoter fused 
with the GUS gene. The 5' untranslated leader of this chimeric 
gene contains a copy of the maize Adhl intron #1. It is used 
here as a transformation control. Although the same amount of 
PCIS246 is added to each transformation, the calculated 
activity varied among Bt constructs tested. The values 
reported below are averages of 3 replicates. pCIB4407 was 
tested twice. 

pCXB3069 28 nM MU/ug/min 

PCIB4407 0.7 nM MU/ug/min, 2.3 nM MU/ug/min 

EXAMPLE 5A: ASSAY OF SYNTHETIC cryLA(b) gene FOR INSECTICIDAL 
ACTIVITY AGAINST EUROPEAN CORN SORER ■ 

The synthetic cryLA(b) gene in pCIB4414 in S. coli.is 

assayed for insecticidal activity against European corn borer 
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according to the following protocol. 

Molten artif ical insect diet is poured into a SO arm 
Gellman snap-cap petri dish. After solidification, E. coli 
cells, suspended in 0.1% Triton X-100, are- spread over the 
surface at a concentration of 3 X 107 cells/cax2. The plates 
are air dried. Ten first instar European corn borer, Ostrinia 
nubilaiis , which are less than 12 hours old are then placed 
onto the diet surface. The test is incubated at 30 C in 
complete darkness for 2-5 days. At the end of the test percent 
mortality is recorded. A positive clone has been defined as 
one giving 50% or higher mortality when control E. coli cells 
give 0-10% background mortality. 

For comparison, the native crylA(b) gene in pCIB3069 is 
tested at the same concentration. Clones are tested at 3 x 10 7 
cells /cm* - diet; 20 insects per clone. 

The following results are observed; 



Clone Percent Mortality 

Control o 
PCIB3069 100 
PCIB4414 100 

These results indicate that the insecticidal crystal 

protein produced by the synthetic cry LA (b) gene demonstrates 

activity against European com borer comparable to that of the 

IP produced by the native crylA(b) . Other plasmids containing 

a synthetic crylA(b) gene were assayed in a similar manner. 

EXAMPLE SAB: ASSAY OF CRYIA(b) PROTEIN FOR INSECTICIDAL 
ACTIVITY AGAINST SUGARCANE BORER. 

Cry LA (b) was expressed in coli and assayed for 
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insect:! cirial activity against Sugarcane borer (Diatrea 
saccharalis ) according to the same protocol used for European 
corn borer, described immediately above. The results are 
summarized in the Table. 

TABLE 

SUGARCANE 30RBR ASSAY WITH 3t PROTEIN FROM COLI 



Protein 


Percent 


Concentration 


Mortality 


(ng/g) 


CrylA(b) 


10 


0 


25 


0 


50 


7 


100 


13 


250 


40 


500 


53 


1000 


80 


LCSO 


380 


95% Cl 


249-S46 



The results indicate that the insecticidal protein 
produced by a maize optimized 3t gene is effective against 
Sugarcane borer* The upper concentrations of CrylA(b) protein, 
250 ng/g-1000 ng/g / are achievable in transgenic maize plants 
produced in accordance with the instant invention. 



EXAMPLE 6: MAIZE PROTOPLAST ISOLATION AND TRANSFORMATION WITH 
THE SYNTHETIC 3T GENE 

Expression of the synthetic 3t gene is assayed in 

transiently transformed maize protoplasts. 



Protoplast Isolation Procedure: 

1. The contents of 10 two day old maize 2717 Line S 
suspension cultures are pipetted into 50 ml sterile tubes and 
allowed to settle. All culture media is then removed and 
discarded. 



84 



WO 93/07278 



PCT/US92/08476 



2. Cells (3-5 mi Packed Cell Volume) are resuspended 
in 30 ml protoplast enzyme solution. Recipe follows: 

3% Callulase RS 

1% Macerozyme RIO in KMC Buffer 
KMC 3uffer (recipe for 1 liter) 
KCi 8.65 g 

MgCl 2 -6H 2 0 16.47 g 

CaCl 2 -2H 2 0 12.50 g 

MES 5.0 g 

pH 5.6, filter sterilize 

3. Mix cells well and aliquot into 100x25 mm petri 
dishes, about 15 ml per plate. Shake on a gyratory shaicer for 4 
hours to digest. 

4. Pipette 10 ml KMC through each 100 micron sieve to 
he used. Filter contents of dishes through sieve. Wash sieve 
with an equal volume KMC. 

5. Pipette sieved protoplasts carefully into 50 ml 
tubes and spin in a Beckman TJ-6 centrifuge for 10 minutes at 
1000 rpm (500 x g) . 

6. Remove supernatant and resuspend pellet carefully in 
10 ml KMC. Combine contents of 3 tubes into one and bring 
volume to 50 ml with KMC. 

7. Spin and wash again by repeating the above step. 
3. Resuspend all washed protoplasts in 50 ml KMC. 

Count in a hemocytometer . Spin protoplasts and resuspend at 3 
x 10 6 /ml in resuspending buffer (RS Buffer) . 
RS Buffer (recipe for 500 ml) 
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mannitol 27.33 g 

CaCl 2 (0.1 M stock) 75 ml 
MES 0.5 g 

pH 5.3, filter sterilize 
Protoplast Transformation Procedure: 

1. Aliquot 50 ug plasmid DNA (Bt IP constructs, botii 
synthetic (pCIB4407) and native (pCIB3069) ) to 15 ml 
polystyrene culture tubes. Also aliquot 25 ug GUS -containing 
plasmid DNA (which does not contain Bt IP (pCXB246) to all 
tubes. 3 replications are used per construct to be tested, 
with 1 rep containing no DNA as a control. 

3t constructs: GUS construct: 

PCIB3069 PCIB246 
pCIB4407 

2. Gently mix protoplasts well and aliquot 0.5 ml per 

tube . 

3. Add 0.5 ml PEG-40 to each tube. 

PEG-40: 

0.4 M mannitol 

0.1 M Ca(N0 3 J 2 -4H 2 0 

pH 8.0, filter sterilize 

4. Mix gently to combine protoplasts with PEG. Wait 30 

minutes . 

5. Sequentially add 1 ml, 2 ml, and 5 ml W5 solution at 
5 minute intervals. 

W5 Solution: 
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154 mM NaCi 

125 mM CaCl 2 ^H 2 0 

5 mM KC1 

5 mM glucose 
pH 7.0, filter sterilize 

6. Spin for 10 minutes in a BecJcman TJ-6 centrifuge at 
about 1000 rpm (500g) . Remove supernatant. 

7 . Gently resuspend pellet in 1.5 ml FW media and plate 
carefully in 35x10 mm petri dishes. 

FW media (recipe for 1 liter) : 



MS salts 4.3 g 

200X B5 vits. 5 ml 

sucrose 30 g 

proline 1.5 g 

mannitol 54 g 

2,4 D 3 mg 



pH 5.7, filter sterilize 

S. Incubate overnight in the dark at room temperature. 
9. Perform GUS assays, insect bioassays, and ELISA's on 
protoplast extracts as described below. 

EXAMPLE 7: CONSTRUCTION OF A FULL- LENGTH SYNTHETIC MAIZE 
OPTIMIZED CRYIA(b) GENE 

Sequence 4 shows the synthetic maize optimized sequence 

encoding the full-length cry LA (b) insecticidal protein from 3. 

thurinaiensis . The truncated version described above 

represents the first approximately 2 Kb of this gene. The 

remainder of the fuli- length gene is cloned using the 
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procedures described above. Briefly, this procedure entails 
"synthesizing DNA oligomers of 40 to 90 NT in length, typically 
using 80 mers as an average size. The oligomers are purified 
using standard procedures of HPLC or recovery from a 
polyacrylaaide gel. Purified oligomers are Jcinased and 
hybridized to form fragments of about 500 bp. The hybridized 
oligomers can be amplified using PCR under standard conditions. 
The 500 bp fragments, either directly from hybridizations, from 
PCS. amplification, or recovered from agarose gels after either 
hybridization or PCS. amplification, are then cloned into a 
plasmid and transformed into ^ coli using standard procedures. 

Recombinant plasmids containing the desired inserts are 
identified, as described above, using PCX and/or standard 
miniscreen procedures. Inserts that appear correct based upon 
their PCS. and/or restriction enzyme profile are then sequenced 
to identify those clones containing the desired open reading 
frame. The fragments are then ligated together with the 
approximately 2 Kb synthetic sequence described in Example 2 to 
produce a full-length maize optimised synthetic crylA(b) 
gene useful for expression of high levels of CrylA(b) protein 
in maize. 

G+C Content of native and synthetic 3t genes: 
Full-length native 38.3% 
Truncated native 37.2% 
Full-length synthetic 64.8% 
Truncated synthetic 54. S% 
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% homology of the final truncated version of the 3t gene 
relative to a "pure* maize codon usage gene: 98.25% 
EXAMPLE 3 Construction of a plant expressible, full-length, 
hybrid partially maize optimized crylA(b) gene. 

PCIB4434 contains a full length CrylA(b) gene 
comprised of about 2 Kb of the synthetic maize optimized 
crylA(b) gene with the remainder (COOH terminal encoding 
portion) of the gene derived from the native gene. Thus, the 
coding region is a chimera between the synthetic gene and the 
native gene, but the resulting protein is identical to the 
native crylA(b) protein. The synthetic region is from 
nucleotide 1-1938 (amino acids 1 to 546) and the native coding, 
sequence is from nucleotide 1939-3468 (amino acids 647 to 
1155) . The sequence of this gene is set forth in Fig. 7. A 
map of pCIB4434 is shown in Fig. 8. 

The following oligos were designed to make pCIB4434: 

KE134A28 « 5' -CGTGACCGAC TACCACATCG ATCAAGTATC CAATTTAGTT 
GAGT-3' 

KE135A28 * 5' -ACTCAACTAA ATTGGATACT TGATCGATGT GGTAGTCGGTC 
ACG-3' 

KE136A28 = 5 ' -GCAGATCTGA GCTCTTAGGT ACCCAATAGC GTAACGT-3' 
KE137A28 = 5* -GCTGATTATG CATCAGCCTAT-3 ' 
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KE138A28 = 5' -GCAGATC7GA GCTCTTATTC CTCCATAAGA AGTAATTC-3 ' 
MK05A28 = 5' -CAAAGGTACC CAATAGC3TA AC3-3' 
MK35A28 = 5' -AACGAGGTGT ACATCGACCG-3' 

PCI34434 is made using a four-way ligarion with a 5.7 
leb fragment: from pCXB4418, a 346 bp 3sr 5 IINKpn 1 
PCR-generated synthetic : native fusion fragment, a 108 bp Kpn 
I\Nsi I native CrylA(b) fragment from pCIB1315, and a 224 bp 
Nsi I\Bgi II PCR-generated fragment. Standard conditions for 
ligation and transformation are as described previously. 

A synthetic: native gene fusion fragment is made in two 
steps using PCS.. The first 253 bp of the PCR fusion fragment 
is made using 100 pmols of oligos KE134A28 and MK04A28 with 
approximately 200 ng of native crylA(b) template in a 100 ul 
volume with 200 nm of each dNTP, 1 X PCX buffer (Perxin Elmer 
Cetus) , 20 % glycerol, and 5 units of Taq polymerase (Perkin 
Elmer Cetus) . The PCS. reaction is run with the following 
parameters: 1 minute at 94 »C, 1 minute at 55 -C, 45 seconds at 
72-C, with extension 3 for 3 seconds for 25 cycles. A fraction 
(1 %) of this first PCR reaction is used as a template along 
with 200 ng of the synthetic crylA(b) DNA to make the complete 
351 bp synthetic: native fusion fragment. Oligos used as PCR 
primers in this second PCS. reaction are 50 pmols of MK35A28, 50 
pmols of MK04A28, and 25 pmols of KE135A28. The PCR reaction 
mix and parameters are the same as those listed above. The 

90 



WO 93/07278 



PCT/US92/08476 



resultant 351 bp synch tic: native fusion fragment is treated 
with Proteinase K at 50 ug\mi total concentration and 
phenol \ chloroform extraction followed by ethanol precipitation 
before cutting with Bst 3 II\Kpn I using standard conditions. 

The 224 bp Nsi I\Bgl II PCS fragment used in maJcing 
pCIB4434 is made using 100 pmois of oligos XE137A28 and 
KE138A28 and 200 ng of the native crylA(b) gene as template in 
100 ul volume with the same PCI reaction mix and parameters as 
listed above. The 230 bp ?CR native crylA(b) fragment is 
treated with Proteinase K, phenol\chlorof orm extracted, and 
ethanol precipitated as described above, before cutting with 
Nsi I\Bgl II. 

pCIB4434 was transformed into maize protoplasts as 
described above. Line 6 2717 protoplasts were used with 
OCIB4434 and pCI34419 as a control for comparison. The results 
are shown below: 

ng 3t/mg protein 
4419 (35S) 14 f 400 ± 2,100 

4434 (full-length) 2,200 ± 900 

Background = 13 ng 3t/mg protein for untransf ormed protoplasts 

The results indicate that ?CIB4434 expresses at a level 
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of about 15% of pCIB4419. 

Western blot analysis snows at least one-tiiird of the 
crylA(b) protein produced by P CIB4434 in this system is about 
130 JcD in size. Therefore, a significant amount of full-length 
crylA(b) protein is produced in maize cells from the expression 
of pCIB4434. 

EXAMPLE 7. Construction of a full- length, crylA(b) genes 
encoding a temperature-stable crylA(b) protein. 

Constructs pCIBS511-5515, each containing a 
full-length, crylA(b) gene are described below. In these 
sequences, the 26 amino acid deletion between amino acids 793 
and 794, KCGEPNRCaPHLEWNPDLDCSCSDGE, present in crylA(a) and 
crylA(c) but not in crylA(b), has been repaired. The gene in 
PCTB5513 is synthetic; the other four genes are hybrids, and 
thus are partially maize optimized. 

Construction of pCIB5511 
This plasmid is a derivative of pCIB4434. A map of 
?CI35511 is shown in Fig. 10. A 435 bp segment of DNA between 
bp 21S5 and 2S90 was constructed by hybridization of synthetic 
oligomers designed to represent the upper and lower strand as 
described above for the construction of the truncated crylA(b) 
gene. This segment of synthetic DNA is synthesized using 
standard techniques known in the art and includes the 26 amino 
acid deletion found to occur naturally in the crylA(b) protein 
in 3acillus thurinciensis Jcurstaki HD-1 . The entire inserted 
segment of DNA uses maize optimized codon preferences to encode 
amino acids. The 26 amino acids used to repair the naturally 
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occurring deletion are contained within this fragment. They are 
inserted starting at position 2387 between the Xpnl site at nt 
2170 and the Xbal site at nt 2508 (2586 in pCIBSSll) of 
PCI34434. pCI35511 is constructed via a three way ligation 
using a 3.2 Kb fragment obtained by restriction digestion of 
pCIB4434 with SphI and Kpnl,. a 3.3 Kb fragment obtained by 
digestion of pCIB4434 with SphI and Xbal, and a 416 bp fragment 
obtained by digestion of the synthetic DNA described above, 
with Kpnl and Xbal. Enzymatic reactions are carried out under 
standard conditions. After ligation, the DNA mixture is 
transformed into competent 3^ coli cells using standard 
procedures. Transf ormants are selected on L-agar containing 100 
lig/ml ampiciilin. Plasmids in transf ormants are characterized 
using standard mini-screen procedures. The sequence of the 
repaired crylA(b) gene encoding the cry LA (b) temperature (heat) 
stable protein is set forth in Fig. 9. 

Construction of ?CIB5512 
This plasmid construct is a derivative of pCIB4434. 
A map of pCI3S512 is shown in Fig. 12. DMA to repair the 26 
amino acid deletion is prepared using standard techniques of 
DNA synthesis and enzymatic reaction. Three double stranded DNA 
cassettes , pGFcasl, pGFcas2 and pGFcas3, each about 300 bp in 
size, are prepared. These cassettes are designed to contain the 
maize optimized codons while maintaining 100% amino acid 
identity with the insecticidai protein. These cassettes are 
used to replace the region between restriction site BstEII at 
position 1824 and Xbal at position 2508 and include the 
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insertion of the additional 78 bp which encode the missing 26 
amino acids (described above for ?C 1353 11 in ?CIB4434) . 3ach of 
these cassettes is cloned into the ScoRV site of the vector 
Bluescript (Stratagene) by standard techniques. The three 
cassettes are designed to contain overlapping restriction 
sites. Cassette 1 has restriction sites SstEII at the 5' end 
and ScoRV at the 3' end: cassette 2 has ScoBV at the 5' end and 
Clal at the 3' end and cassette 3 has Clal at the 5' end and 
Xba I at the 3' end. They are cloned individually in 31uescript 
and the the complete 762 bp fragment is subsequently assembled 
by ligation using standard techniques. pCIBS512 is assembled 
using this 762 bp fragment and ligating it with a 6.65 Kb 
fragment obtained by a complete digestion of p.CIB4434 with 
BstSII and a partial digestion with Xbal. Alternatively, a four 
way ligation using the same vector and the three cassettes 
digested with the specific enzymes can be employed. Enzymatic 
reactions are carried out under standard conditions. After 
ligation, the DNA mixture is transformed into competent S*. coli 
cells using standard procedures. Transformants are selected on 
L-agar containing 100 ug/ral ampicillin. Plasmids in 
transformants are characterized using standard mini-screen 
procedures. The resulting plasmid is ?CI3S512. The sequence 
of the repaired crylA(b) gene is illustrated in Fig. 11. This 
repaired crylA(b) differs from that carried in pCIBS511 in that 
a larger region of the crylA(b) coding region is optimized for 
maize expression by using maize preferred codons. 

Construction of pCI35513 
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This plasmid contains a repaired crylA(b) gene derived 
from pCIBS512. A map of ?CI35513 is shown in Fig. 14. The 
region 3' from the Xbal site at position 2586 to the end of the 
gene (Bglll site at position 3572) is replaced entirely with 
maize optimized codons. This region is synthesized, using 
standard techniques of DNA synthesis and enzymatic reaction, 
well Jcnown in the art, as four double stranded DNA cassettes 
(cassettes # 4,5 ,6 ,7). Adjacent cassettes have overlapping 
restriction sites to facilitate assembly between cassettes. 
These are Xbal and Xhol at the 5' and 3' ends of cassette 4; 
Xhol and SacI at the 5' and 3' ends, respectively, of cassette 
5; SacI and BstXI at the 5' and 3' ends, respectively, of 
cassette 6; and BstXI and Bglll at the 5' and 3' ends, 
respectively, of cassette 7. As described for ?CIB5512, the 
cassettes are cloned into the blunt-end ScoKV site of the 
Bluescript vector (Stratagene) and the full-length "repaired" 
crylA(b) gene cloned either by sequential assembly of the above 
cassettes in Bluescript followed by ligation of the complete 
967 bp synthetic region with a 5448 bp fragment obtained by a 
complete digestion of pCI35512 with Bglll and a partial 
digestion with Xbal. Alternately, the plasmid containing the 
full-length genes is obtained by a 5-way ligation of each of 
the four cassettes (after cleavage with the appropriate 
enzymes) and the same vector as above. The sequence of the 
full-length, "repaired" crylA(b) gene is set forth in Fig. 13. 
The protein encoded by the various synthetic and 
synthetic/native coding region chimeras encode the same 
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protein. This protein is the heat-stable version of crylA(b) 
produced by repairing the naturally occurring 26 amino acid 
deletion found in the crylA(b) gene from aacillus thurinaiensis 
icurstaki HD-1 when the homologous region is compared with 
either crylA(a) or crylA(c) Bacillus thurinoiensis 
delta-endotoxins * 

Construction of ?CIBS514 
This plasmid is a derivative of pCIB4434. .A map of 
pCIB5514 is shown in Fig- IS. It is made using synthetic DNA 
cassette #3 (see above) which contains a maize optimized 
sequence of the region between the Clal sits (position 2396) 
found in the 26 amino acid thermostable region and the Xbal 
site at position 2508 in pCI34434 (2586 in pCI35511) . The 
region between at 2113 of ?CI34434 and the junction of the 
thermostable region is ?CR amplified by using ?CIB4434 as 
template with the following primers: 

forward: 5' GCACCGaiATCACZATCCAAGGAGGCGATGaCGTATTCAAAG-3' 
reverse: 

5' -AGCGCATCGAMCSGCTCCCCGCACTTC • 
The PCS. product is then digested with restriction 
enzymes Kpnl and Clal and ligated in a four part reaction with 
a 189 bp fragment obtained by digestion of cassette 3 with Clal 
and Xbal, a 3.2 Kb fragment of pCI34434 digested with SphI and 
Kpnl, and a 3.8 Kb fragment of pCI34434 obtained by digestion 
with SphI and Xba. Enzymatic reactions are carried out under 
standard conditions. The ligation product is transformed into 
competent E coli cells, selected with ampicillin and screened 
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using standard procedures described above. Th sequence of the 
repaired crylA(b) gene contained in pCXB5514 is shown in Fig. 
15 . 

Construction of pCIBSSIS 
pCIB4434 was modified by adding the 78bp Geiser 
thermostable element (Geiser TSE) , described above, between 
the Kpn I site (2170 bp) and the Xba I site (2S08 bp) in the 
native 3tk region. The exact insertion site starts at the 
nucleotide #2379. The region containing the Geiser TSE was 
amplified by two sets of PCR reactions, i.e. the Kpn I - Geiser 
TSE fragment and the Geiser TSE - Xba I fragment. 

PCR p rimer* 1: (Kpn I site) 

5' - ATTACGTTAC GCTATTGGGT ACCTTTGAXG - 3' 

PCR primer! 2: (Geiser TSE bottom) 

5' - TCCCCG7CCC TGCAGCTGCA GTCTAGGTCC GGGTTCCACT 
CCAGGTGCGG AGCSCATCGA TTCGGCTCCC CGCACTTGCC 
GATTGGACTT GGGGCTGA - 3' 

PCR primer#3: (Geiser TSE top) 

5/ - CAAGTGCGGG GAGCCGAATC GATGCGCTCC GCACCTGGAG 
TGGAACCCGG ACCTAGACTG CAGCTGCAGG GACGGGGAAA 
AATGTGCCCA TCATTCCC - 3' 

PCR primer*4: (Xba I site) 

5' - TGGTTTCTCT TCGAGAAATT CTAGATTTCC - 3' 
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After the amplification, the ?CR fragments were 
digested witii (Kpn I +■ Cla I) and (Cla I + Xba I) , 
respectively. These two fragments were ligated to the Kpn I 
and Xba I digested pCIB4434 . The resulting construct pCIBSSIS 
is pCIB4434 with a Geiser ?SE and an extra Cla I site flanked 
by Kpn I and Xba I. A map of pCI35515 is illustrated in Fig. 
38. The crylA(b) gene contained herein, which encodes a 
temperature stable cryLA(b) protein, is shown in Fig. 37. 

Examples 9-20 set forth below are directed to the 
isolation and characterization of a pith-preferred promoter. 
example 9. SNA isolation and Northern Blots 

All SNA was isolated from plants grown under greenhouse 
conditions. Total SNA was isolated as described in Kramer et 
al..' Plant Phvsiol., 90:1214-1220 (i990) from the following 
tissues of Funk maize line 5N984: 8, 11, IS, 25, 35, 40, and 60 
day old green leaves; 8, 11, 15, 25, 35, 39, 46, 60 and 70 day 
old pith; 60 and 70 day old brace roots from Funk maize line 
5N984; 60 and 70 day 5N984 sheath and ear stock. RHA was also 
isolated from 14 day 211D roots and from developing seed at 
weekly intervals for weeks one through five ?ost-pollenation. 
Poly A+ SNA was isolated using oligo-d? as described by 
Sambrook et al.. Molecular Cl oning: A Laboratory Manual (2nd 
ed.), 1989, and Northern blots were carried out, also as per 
Sambrook et al. using either total RNA (30 ug) or poly A+ RNA 
(2-10 ug) . After electrophoresis, RNA was blotted onto 
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Nitroplus 2000 membranes (Micron Separations Inc) . The SNA was 
linked to the filter using the St ratal inker (Stratagene) at 0.2 
mJoules. The northerns were probed with the 1200 hp ScoRI pith 
(TRpA) 8-2 cDNA fragment, isolated by using 0.3% low melting 
temperature agarose in a T3E buffer system. Northerns were 
hybridized and washed and the filters exposed to film as 
described in Isolation of cDNA clones. 

EXAMPLE 10. Isolation of cDNA Clones 

First strand cDNA synthesis was carried out using the 
3RL AMV reverse transcriptase system I using conditions 
specified by the supplier (Life Technologies, Inc., 
Gaithersburg, MD) „ Specifically, 25 ul reactions containing 50 
mM Tris-HCl pH 3.3, 20 mM KC1, 1 mM DTT, 6 mM MgC12, 1 mtf each 
of each dNTP, 0.1 mM oligo <dT) 12-18, 2 ug pith poly<A+) SNA* 
100 ug/ml BSA, 50 ug/ml actinomycin 0 , 8 units placental RNase 
inhibitor, 1 ul (10 mM Ci/ml) 32P dCTP >3000 mCi/raM as tracer, 
and 30 units AMV reverse transcriptase were incubated at 42 # C 
for 30 min. Additional KC1 was added to a concentration of 50 
mM and incubation continued a further 30 min. at 42 *C. KC1 was 
added again to yield a final concentration of 100 mM. 
Additional AMV reverse transcriptase reaction buffer was added 
to maintain starting concentrations of the other components 
plus an additional 10 units, and the incubation continued at 
42 # C for another 30 min. Second strand synthesis was completed 
using the Riboclone cDNA synthesis system with Eco RI linkers 
(Promega, Madison, Wi) . Double stranded cDNA was sized on an 
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1% agarose gel using Tris-borate-SDTA buffer as disclosed in 
Sambrook et al. ( and showed an average size of abour 1.2 Kb. 
The cDNA was size fractionated using NA45 DEAE membrane so as 
to retain those molecules of about 1000 bp or larger using 
conditions specified by the supplier (Schleicher and Schuell) . 
Size fractionated cDNA was ligated into the Lambda ZapII vector 
(Stratagene, La Jolla, GA) and packaged into lambda particles 
using GigapacJc II Plus (Stratagene, La Jolla, CA) . The 
unamplified library had a titer of 315,000 pfu while the 
amplified library had a titer of 3.5 billion/ml using SIM' 
cells . 

Recombinant phage were plated at a density of 5000 pfu 
on 150 X 15mm L-agar plates. A total of 50,000 phage were 
screened using duplicate lifts from each plate and probes of 
first strand cDNA generated from either pith derived mRNA or 
seed derived aRNA. The lifts were done as described in 
SambrooJc et al. using nitrocellulose filters. DNA was fixed to 
the filters by OV crosslinlcing using a Stratalinker 
(Stratagene, La Jolla, CA) at 0.2 mJoule. Prehybridization and 
hybridization of the filter were carried out in a solution of 
10X Denhardts solution, 150 ug/ml sheared salmon sperm DNA, 1% 
SDS, SO mM sodium phosphate pH 7, 5 mM EDTA, SX SSC, 0.05% 
sodium pyrophosphate. Prehybridization was at 62-C for 4 hours 
and hybridization was at 62 -C for 18 hours (overnight) with 1 
million cpm/ml in a volume of 40 ml. Filters were washed in 
500 ml of 2X SSC, 0.5% SDS at room temperature for 15 ain. then 
at S3-C in 0.1X SSC, 0.5% SDS for 30 min. for each wash. 
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Radiolabeled DNA probes were made using a 3RL random prime 
labeling system and unincorporated counts removed using Nick 
Columns (Pharmacia) . Filters were exposed overnight to Kodak 
X-Omat AR X-ray film with (DuPont) Cronex Lightning Plus 
intensifying screens at -30 # C. Plaques showing hybridization 
with the pith-derived probe and not the seed-derived probe were 
plaque purified for further characterization. 

EXAMPLE 11. Isolation of Genomic Clones 

Genomic DNA from Funk inbred maize line 211D was 
isolated as described by Shure et al., Call , 35:225-233 (1988). 
The DNA was partially digested with Sau 3A and subsequently 
size fractionated on 10-40% sucrose gradients centrifuged in a 
Beckman SW40 rotor at 22, 000 rpm for 20 hours at 20»C. 
Fractions in the range of 9-23 Kb were pooled and ethanol 
precipitated. Lambda Dash II (Stratagene) cut with Bam HI was 
used as described by the supplier. The library was screened 
un amplified and a total of 300, 000 pfu were screened using the 
conditions described above . The library was probed using pith- 
specific (TrpA) cDNA clone 3-2, pCI35600 which was identified 
in the differential screen of the cDNA library. Isolated 
clones were plaque purified and a large scale phage preparation 
was made using Lambdasorb (Promega) as described by the 
supplier. Isolated genomic clones were digested with Sco RI 
and the 4.8 kb ScoRI fragment was subcioned into Bluescript 
vector (Stratagene) . 
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EXAMPLE 12. DNA Sequence and Computer Analysis 

Nucleotide sequencing was performed using the dideoxy 
chain-termination method disclosed in Sanger et al., 
PNAS, 74:5463-5467 (1977). Sequencing primers were synthesized 
on an Applied 3iosystems model 380B DNA synthesizer using 
standard conditions. Sequencing reactions were carried out 
using the Sequenase system (US Biochemical Corp . ) . Gel 
analysis was performed on 40 cm gels of 6% polyacrylamide with 
7 M urea in Tris-Borate-SDTA buffer (BRL Gel-Mix 6) . Analysis 
of sequences and comparison with sequences in GenBanlc were done 
using the a. of Wisconsin Genetic Computer Group Sequence 
Analysis Software (UWGCG) . 

sxamplb 13. Mapping the Transcriptional Start Site 

Primer extension was carried according to the procedure 
of Metraux et al., PNAS , 86: 396-900 (1988). Briefly, 30 ug of 
maize pith total RNA. were annealed with the primer in SO mM 
Tris ?H 7.S, 40 mM XC1, 3 mM MgC12 (RT buffer) by heating to 
80 »C for 10 minutes and slow cooling to 42'C. The 3NA/primer 
mix was allowed to hybridize overnight. Additional RT buffer, 
DTT to 6 mM, 3SA to 0.1 mg/ml, SNAsin at 4 O/ml and dNTP' s at 1 
mM each were added. Then 8 units AMV reverse transcriptase were 
added and reaction placed at 37 «C for one hour. The primer used 
was 5' -CCGTTCGTTC CTCCTTCGTC GAGG-3', which Starts at +90 bp 
relative to the transcription start. See Fig. 29A. A 
sequencing ladder using the same primer as in the primer 
extension reaction was generated using the 4.8 Kb genomic clone 
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to allow determination of the transcriptional start site. The 
sequencing reaction was carried out as described in Example 12. 

RNase protection was used to determine if the the 371 
bp sequence from +2 bp to +373 bp (start of" cDNA) was 
contiguous or if it contained one or more introns. A 385 bp 
Sphl-Ncol fragment spanning +2 bp to +387 bp relative to 
transcriptional start see ?ig. 29B was cloned into pGEM-5Zf (+) 
(Promega) and transcribed using the Riboprobe Gemini system 
(Promega) from the SP6 promoter to generate radioactive 
antisense RNA probes as described by the supplier. RNase 
protection was carried out as described in SambrooJc et al.. 
pBR322 (cut with Hpall and end labelled with 32P-dCTP) and 
Klenow fragment were used molecular weight markers. Gels were 
S% acrylamide/7M urea (BRL Gel-Mix 6) and were run at 60 watts 
constant power. 

5XAMPIS 14. Genomic Southern Blots 

Genomic DMA was isolated from aai2e line 21 ID using the 
procedure of Shure et al. f supra. 9 ug of genomic DNA were 
used for each restriction enzyme digest. The following enzymes 
were used in the buffer suggested by the supplier: 3amHI, 
BcoRI, EcoflV, Hindlll, and Sad. Pith cDNA clone number 8-2 
was used for estimating gene copy number. The digested DNA was 
run on a 0.7% agarose gel using Tris-Borate-SDTA buffer system. 
The gel was pretreated with 250 mM HC1 for IS min. to 
facilitate transfer of high molecular weight DNA. The DNA was 
transferred to Nitroplus 2000 membrane and subsequently probed 
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with the Pith cDNA 8-2. The blot was washed as described 
in Example 10. 

ra&MPLS 15. PCS. Material and Methods 

PCS. reactions were preformed using the GeneAmp DNA 
Amplication reagent kit and AmpliTaq recombinant Taq DMA 
oolaerase (Perlcin Slaer Cetus) . Reaction condition were as 
follows: 0.1 to 0.5 uM of each of the two primers used per 
reaction, 25 ng of the pith 4.8 Kb EcoRI fragment in 
Bluescript, plus the PCS reaction mix described by the supplier 
for a total volume of 50 uL in 0.5 mL GeneAmp reaction tube 
(Berlin Slmer Cetus) . The DNA Thermal Cycler (Perlcin Slaer 
Cetus) using the Step-Cycle program set to denature at 94 *C for 
60 s, anneal at 55»C for 60 s f and extend at 72-C for 45 s 
followed by a 3-s-per-cycle extension for a total of 30 cycles. 
The following primer sets were used: I. 83 X 84, -429 bp to -2 
bp; II. 49 x 73, -69 bp to +91 bp; III. 38 X 41, +136 bp to 
+258 bp; and IV. 40 X 75, +239 bp to +372 bp. These are marked 
on Fig. 24. 

BXAMPLS 16. Isolation of a Pith-Preferred Gene. 

A cONA library derived from pith mRNA cloned into 
Lambda Zap and screened using first strand cDHA derived from 
either pith or seed mRHA. Clones which hybridized with only 
the pith probe were plaque purified and again screened. Clones 
passing the second screen were used as probes in northern blots 
containing RNA from various maize tissues i 
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2XAMFLS 17. Gene Structure and Sequence Analysis. 

The 1.2 Kb insert of the cDNA clone 8-2 was sequenced 
using the dideoxy method of Sanger et al., supra. Likewise, 
the genomic equivalenr contained on a 4 . 3 Kb ScoRI fragment in 
Bluescript denoted as ?CI35601, was sequenced. This 
information revealed that the genomic copy of the coding region 
spans 1.7 Kb and contains five introns . The mRNA transcript 
represents six exons. This is shown in Fig. 24. The exons 
range in size from 43 bp to 313 bp and the introns vary in size 
from 76 bp to 130 bp. The entire sequence of the gene and its 
corresponding deduced amino acid sequence are shown in Fig. 24. 

This gene encodes a protein of 346 amino acids with a 
molecular mass of about 38 JcD. As illustrated in Table 1, the 
predicted protein shows 52% similarity and 41% identity with 
the subunit protein of Pseudomonas aeruginosa and has high 
homology with trpA proteins from other organisms. 

Table 1 

Conservation of TrpA sequences between a maize TrpA gene and 
other organisms. 



Organisms % amino acid % amino acid 

compared Similarity Identity 



Haloferax volancii 56.4 36.1 

Methanococcus voltae 53.1 35.1 

Pseudomonas aeruginosa 62.5 41.8 

Neurospora crassa 61.4 39.3 

Saccharomycas cerevisiae 56.7 36.1 



WO 93/07278 



PCT/US92/08476 



Similarity"groupings ^ t= L = M =V, D=E, F-Y, K=R, N-Q, S=T 

Similarities and indentities were done using the GAP program 
from DWGC3- , . - nn 

Crawford et al., Ann. Rev. Microoxol. . 43:567-o00 

(1989) , incorporated herein by reference, found regions of 

conserved amino acids in bacterial trpA genes. These are amino 

acids 49 to 58 , amino acids 181 to 184, and amino acids 213 to 

215, with the rest of the gene showing greater variability than 

is seen in the TrpB sequence. An alignment of known trpA 

proteins with the maize TrpA protein (not shown) illustrates 

that the homology between the maize gene and other trpA 

proteins is considerable. Also, it is comparable to the level 

of homology observed when other TrpA proteins are compared to 

each, other as described in Crawford et al., supra. 

To determine the location of the transcription start 

site and whether or not there were introns present in this 

region, four polymerase chain reacrion (PCS.) generated 

fragments of about 122 bp to 427 bp from tiie region -429 bp to 

+372 bp were used for northern analysis. The results of the 

northerns snowed that PO probes II, III, IV hybridized to pith 

total SNA and ?CR probe I did not hybridize. This indicated 

that the transcription start was in the -69 bp to +90 bp 

region. To more precisely locate the transcriptional start 

site, primer extension was employed. Fig. 29A shows that when 

a primer (#73) located at +90 bp relative to the 

transcriptional start is used for primer extension, the 

transcriptional start site is located at +1, 1726 bp on the 



106 



WO 93/07278 



PCT/US92/08476 



genomic sequence. 

The first ATG from the transcriptional start site is at 
•114 bp* This is the ATG that would be expected to serve as 
the site for translational initiation. This ATG begins an open 
reading that runs into the open reading frame found in the cDMA 
clone. The first SO amino acids of this predicted open reading 
frame strongly resemble a chloroplast transit peptide. See 
Berlyn et al. ?NAS, 86:4604-4608 (1989) and Neumann-Karlin et 
al., SMBO J. , 5:9-13 (1986). This result suggests that this 
protein is targeted to a plastid and is likely processed to 
yield the active protein. Transient expression assays in a 
maize mesophyll protoplast system using a maize optimized B.t. 
gene driven by the trpA promoter showed that when the ATG at 
+114 bp is used as the fusion point, the highest levels of 
expression are obtained* Using either of the next two AXGs in 
the sequence substantially reduces the level of expression of 
the reporter gene. The ATG at +390 bp gave some activity, but 
at a much lower level than the +114 ATG f and the ATG at +201 bp 
gave no activity. 

Athough a number of TATA like boxes are located 
upstream of the upstream of the transcriptional start site at 
+1 bp r the TATAAT at -132 bp is most like the plant consensus 
of TATAAA. See Joshi, Nuc . Acids Res. , 15:6643-6653 (1987). 
The presumptive CCAAT like box was found at -231 bp . The 
nucleotide sequence surrounding the ATG start (GCGACATGGC) has 
homology to other maize translation starts as described in 
Messing et al.. Genetic Engineering of Plants: An Agricultural 
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-5-1 1 -??7 ft oaSl • but differs from 
gerspeetive , Plenum Press, pp. 211-2-2/ u-oj; , 

that considered, a consensus sequence in plants (ANNATGGO . 
See, joshi, above. The presumptive poly (A) addition signal is 
located at 3719 bp (AATAAA) on the genomic sequence, 52 bp from 
the end of the cDNA. The sequence matches known sequences for 
maize as described in Dean et al., Nuc. Acids Res., 
14:2229-2240 (1986), and is located 346 bp downstream from the 
end of protein translation. See Dean et al., Hue. Acids Res. , 
14:2229-2240 (1986). The 3' untranslated sequence of the cDNA 
ends at 3775 bp on the genomic sequence. 

Fig. 28 shows a Southern blot of maize 211D genomic DKA 
with the approximate gene copy number as reconstructed using 
pith gene 8-2 cDNA. From the restriction digests and 
reconstruction there appear to be 1-2 copies of the gene 
present per haploid genome. There do not appear to be other 
genes with lower levels of homology with this gene.. Therefore, 
zh±s represents a unique or small member gene family in maize. 

SXAMPLS 18. RNase Protection 

The structure of the 5' end of the mRNA was determined 
using RNase protection. The RNase protection was carried out 
using a probe representing 385 nt from +2 bp to +387 bp. This 
region from the genomic clone was placed in the RNA 
transcription vector pGEM-5Zf (+> and a 32P labelled RNA probe 
generated using S?6 polymerase. The probe and the extra bases 
from the multiple cloning site produce a transcript of 461 nt. 
The probe was hybridized with total pith RNA and subsequently 
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digested with a mixture of RNase A and Tl and til protected 
fragments analyzed on denaturing polyacrylami.de gels. Analysis 
of the gels shows a protected fragment of about 255 nt and 
another fragment of about 160 nt. See ?ig. 29B. 

The fact that primer extension using a primer <#73) at 
+80 bp produces a product of 90 NT in length argues that the 
5' end of the transcript is located at position +1 bp. Primer 
extension from a primer in this region produces a product, so 
one would expect this also to be detected by the RNase 
protection assay. This primer is located in the 5' region of 
the RNase protection probe. The cDNA clone contains sequences 
present in the 3' end of the RNase protection probe and hence 
were expected to be protected in this assay. Since only one 
band is present on the gel which could account for both of 
these sequences, we are confident that the protected fragment 
is indeed the larger band and that the smaller single band is 
an artifact. If there were an intron in this region, fragments 
from each end would be present in the probe, and hence would be 
detectable on the gel. Of the two bands seen, one of them 
appears to represent the entire 5' region, therefore we do not 
believe that there is an intron located in this region. 

SXAMPLS 19. Complementation of coli TrpA Mutant with the 
Pith cDNA 8-2 

5. coli strain CGSC strain 5531 from the E. coli 
Genetic Stock Center, Yale University <0.H. Smith lab strain 
designation, #M5004) with chromosomal markers glnA3, TrpA982S, 
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1-, iN(rmD-rmE) , thi-i as described in Mayer at ai. , Mol. Gen. 
Gentec , 137:131-142 (1975), was transformed with either the 
pith (TRpA) cDNA 3-2 or 31uescript plasmid (Stratagene) as 
described in Sambrook et al., supra. The transfonnants 
containing the TrpA cDNA 8-2 had the ability to grow without 
the presence of tryptophan on minimal medium whereas the 
transfonnants with the Bluescript (Stratagene) plasmid or 
untransformed control were not able to grow without tryptophan. 
The cells transformed with the maize TrpA gene grew very slowly 
with colonies visible after seven days growth at room 
temperature. All strains were grown on M9 minimal medium 
supplemented with 200 ug/ml glutamine, 0.01 ug/ml thiamine and 
with or without 20 ug/ml tryptophan. All transfonnants were 
checked for the presence of the appropriate plasmid by 
restriction enzyme analysis. Colonies growing in the absence 
of tryptophan all contained clone 8-2 containing the cDNA for 
the putative maize TrpA gene, as confirmed by southern 
hybridization (data not shown) . These results support the 
conclusion that this is the maize tryptophan synthase subunit A 
protein. 

3XAMP1S 20. Gene Expression 

The expression pattern of the pith-preferential gene 
throughout the plant. was examined. Different maize genotypes 
were also examined for patterns of expression of this gene. 
The following tissues were used as the source of RNA for these 
studies: upper, middle, and lower pith, brace roots, ear 
shank, cob in genotype 5N984; upper, middle, lower pith, 10 day 
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old leaves r 14 day old roots and pith from the entire plant in 
genotype 211D, and seed from genotype 211D which had been 
harvested at weekly intervals one to five weeks 
post -pollination. Lower pith is derived from, i.e. constitutes 
the two internodes above brace roots; middle pith is derived 
from the next three internodes; upper pith represents the last 
rwo internodes before the tassel in SO and 70 day plants. Only 
two internodes were present in 39 day old plants and three 
internodes for 46 day old plants. Northern blot analysis shows 
that transcripts hybridizing with a probe derived from the pith 
cDNA accumulate rapidly in young pith and young leaf. As the 
age of the plant increases and one moves up the stalk , there is 
a significant decrease in the amount of transcript detected. 
See Figs. 25A-D. At no time is message from this gene detected 
in seed derived RNA, either total RNA or poly A+ UNA. See Fig. 
26. Transcript is also detected in root, earshank, and sheath 
but not at the high levels detected in the pith and young leaf 
tissues. See Figs. 25B, 25C. Some message is detected in 
brace roots, but only at a very low level. See Fig. 25D. Six 
maize undifferentiated callus lines were analyzed by northern 
blot analysis and no expression was found for this gene (data 
not shown) in any callus sample. The level of expression of 
this gene is extremely high since a very strong signal to a 
probe from TrpA gene 8-2 can be detected in pith and leaf as 
little as two hours after exposure of the blot to film (Fig. 
2SA) . The amount of mRNA made is comparable to that derived 
from the maize phosphoenolpyruvate carboxylase gene disclosed 
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in Hudsperh et 1., t,i ,nr m p 1. aioloov, 12:579-589 (1989), 
another highly expressed maize gene. Hudspeth is incorporated 

herein by reference. 

The expression pattern of this gene is not temporally 
constant. Expression is very high in the lower and middle pith 
of plants less than 60 days old and decreases rapidly near the 
top of the plant. As the plant reaches maturity, e.g. over 70 
days old, the expression drops to nearly undetectable levels 
except in the lower pith and earsfaank. The accumulation of 
transcript in young leaf is nearly as high as that seen in 
lower pith but expression decreases rapidly and is undetectable 
in leaves over 40 days of age. Expression in leaf was found to 
be variable depending on the season when it is grown. 

Examples 21-39 set forth below are directed to the 
isolation, characterization and expression analysis of a 
pollen-specific promoter according to the present invention. 

Identification of pollen-specific proteins 
gvatmie 21. Maize Plant Growth 

Maize plants (Zea mays Funk inbred 211D) were grown 
from seed in a vermiculite/sand mixture in a greenhouse under a 
16 hour light/ 8 hour dark regime. 
Example 22. Total Pollen Protein Isolation 

Mature pollen was isolated from maize plants at the 
time of maximum pollen shed. It was sieved to remove debris, 
frozen in liquid nitrogen, and a 3-4 ml volume of frozen pollen 
was ground in a mortar and pestle with an equal volume of 
75-150 um glass beads. 40 ml of grinding buffer (2mM EDTA, 5mM 
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DTT, 0,1% SDS, 100 mM Hepes ?H 8) was added and the mixture was 
ground again. The glass beads and intact pollen grains were 
pelleted by low speed centrifugation, and mixture was clarified 
by centrifugation at 10,000 g for 15 minutes. Protein was 
precipitated from the supernatant by addition of acetone to 
90%. 

Ex a mp le 23. Pollen Sxine Protein Isolation 

Exine Protein was isolated from maize 211D shed pollen 
as described in Matousek and Tupy, J., Plant Physiology 
119:169-178 (1985). 
Example 24. Leaf Protein Isolation 

Young leaves (about 60% expanded) were cut from the 
maize plant the midrib removed. Total protein was isolated as 
for pollen f except that the material was not frozen and 
grinding was in a Waring blender without glass beads. 
Example 25. Xernel Protein Isolation 

Ears with fully developed, but still moist kernels were 
removed from the plant and the kernels cut off with a scalpel. 
Total protein was isolated as for leaves. 
Example 26. Gel Electrophoresis of Maize Proteins 

Pollen, leaf and kernel proteins were separated on SDS 
polyacrylamide gels as described in Sambrook et al, Molecular 
Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory 
Press: New York (1989) . Following staining by Coomasie blue, 
protein bands from pollen, leaf and kernel were compared and 
abundant proteins of approximately 10 kD,13 kD, 20 kD, 45 kD, 
55 kD and 57 kD were determined to be pollen specific. 
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identification of Pollen-Specif ic cDNA clon s 
Sxample 27. Partial Sequence D termination of Pollen-Specific 

Proteins 

Protein bands determined to be pollen-specific were 
purified by electroblotting from the polyacrylamide gel onto 
PVDF membrane (Matsudaira, P., -T. aim. Chem. 261:10035-10038 
(1987) ) or by reverse phase HPLC. N-terminal sequence of the 
purified proteins was determined by automated Edman egradation 
with an Applied Biosystems 470A gas-phase sequencer. 
Phenylthiohydantoin (PTH) amino acids were identified using an 
Applied Biosystems 12 OA PTH analyzer. To obtain internal 
sequence, proteins were digested with endoproteinase Lys-C 
(Boehringer Mannheim) in 0.1 M Tris-HCI, ?H 8.5, for 24 hours 
at room temperature using an enzyme: substrate ratio of 1:10. 
Resulting peptides were isolated by HPLC using an Aquapore C-8 
column eluted with a linear acetonitrile/isopropanol (1:1 
ratio) gradient (0 to 60%) in 0.1% TFA. Sequence of isolated 
Lys-C peptides was determined as above. The following 
sequences were determined for the 13kD pollen-specific protein: 

N-terainus: TTPLTFQVGKGSKPGHLILTPNVATI 
LysC 61: KPGHLILTPNVATISDWIK 
LysC 54: SGGTRIADDVTPADFK 
LysC 49: BHSGDDFSFTLK 
LysC 43: EGPTGTWTLDTX 

Example 28. Synthesis of Oligonucleotide Probes for 
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Pollen-Specific cONAs 

Regions of pepride sequence in the 13kD protein with 
low co don redundancy were selected, and suitable 
oligonucleotide probes for the gene encoding these regions were 
synthesized on an Applied aiosystems 380A synthesizer. The 
following oligonucleotides were synthesized: 

Oligo #51 5'-AAATC ATC ACC ACC ATG TTC-3' 

G G G G C 

T 
C 

Oligo # 58 5'-CC TTT ACC CAC TTG AAA-3' 

C G C G 

T 
r 

where the columns of nucleotides represent bases that were 
incorporated randomly in equal proportions at the indicated 
position in the oligo. Oligo #51 encodes the amino acid 
sequence EHGGDDF found in peptide LysC 49, and Oligo #58 
encodes the amino acid sequence FQVGKG found in peptide 
N- terminus. Use of these mixed oligonucleotides to screen a 
cDNA library for the pollen-specific gene, N will be described 
below. 

Example 29. Construction of a maize pollen cDNA library 
Total maize SNA from maize 211D shed pollen was 
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isolated as described in Glisen et al, Biochemistry 
13:2632-2637 (1974) . Poly A+ mRNA was purified from total SNA 
as described in SambrooJc et al. Using this mRNA, cDNA was 
prepared using a cDNA synthesis Jcit purchased from Promega, 
following protocols supplied with the Jcit. The ScoRI linkers 
were added to the cDNA and it was ligated into arms of, the 
cloning vector lambda Zap, purchased from Stratagene and using 
the protocol supplied by the manufacturer. The ligation 
product was packaged in a lambda packaging extract also 
purchased from Stratagene, and used to infect 3^. coli 3B4 
cells . 

Example 30. Isolation of pollen-specific cDNA clones 

The maize pollen cDNA library was probed using the 
synthetic oligonucleotides probes specific for the 13kD protein 
gene, as described in Sambrook et al. Briefly, about 100,000 
phage plaques of the pollen cDNA library were plated and lifted 
to nitrocellulose filters. The filters were probed using 
oligonucleotides #51 and #58 which had been 32P end-labeled 
using polynucleotide kinase. The probes were hybridised to 
the filters at low stringency (50 degrees C in 1M NaCi, 10% 
dextran sulfate, 0.5% SDS) , washed 30 minutes at room 
temperature and then 30 minutes at 45 degrees C in SX SSC, 0.1% 
SDS, and exposed to X-ray film to identify positive clones. 
Putative clones were purified through four rounds of plaque 
hybridization. Three classes of cDNA clones were isolated. 
Type I contained ScoRI fragments of. 0.2 kb and 1.8 kb. Type II 
contained ScoRI fragments of 0.6 kb, 0.5 kb and 1.0 kb, and 
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Type III contain d an ScoRI fragment of 2.2 leb, 

Sxampl 31. Characterization of Pollen-specific cDNA clones 

The ScoRI fragments of the Type II cDNA clone were 
s cloned into the plasmid vector pBluescript SK+, purchased 
from Stratagene. See Fig. 30. The 0.5 kb fragment in 
pBluescript was named I2-.S, the 0.5 kb fragment in pBluescript 
was named II-. 5 (later renamed pC 1331 69) and the 1.0 kb 
fragment in pBluescript was named II-1.0 (later renamed 
pCIB3168) . As will be described below, the 0.5 kb and 1.0 kb 
fragments encode the maize pollen- specif ic CDPK gene. RNA from 
anthers, pollen, leaf, root and silk was denatured with 
glyoxai, electrophoresed on a 1% agarose gel, transferred to 
nitrocellulose, and probed separately with the three ScoRI 
fragments that had been labeled with 32P by random primer 
extension as described in Sambrook et al, Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Laboratory Press: New 
York (1989) . The blots were exposed to X-ray film, and an. mRNA 
band of approximately 1.5 kb was identified with the 0.6 kb 
fragment probe, while the 0.5 and 1.0 kb fragments hybridized 
to an approximately 2.0 kb mRNA. In all cases hybridization 
was only seen in the pollen RNA lane, with the exception that 
the 0.5 kb fragment showed a slight signal in anther mRNA. The 
conclusion from these data was that the original cONA clone was 
a fusion cDNA molecules derived from two different mRNAs. The 
0.5 kb fragment was a partial cDNA of a 1.5 kb pollen-specific 
mRNA, and this mRNA encodes the peptides LysC 49 and 
N-terminus. The 1.0 and 0.5 kb fragments comprise a partial 
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cOA of a 2.0 kb pollen-specific mRNA unrelated to the peptides 
and oligonucleotide probes used for probes. This conclusion was 
verified when the fragments were sequenced using the dideoxy 
chain termination method as described in SambrooJc ec al. Tae 
cDNA sequence is shown in Fig. 31. 

Sxample 32. Determination of specificity of mRNA expression 

To determine if the 2.0 kb RNA represented by cDNA 
clones pCIB3169 and P CIB31S8 were present only in pollen, total 
SNA was isolated from maize 211D roots, leaves, pollen, anthers 
or silks. The BNAs were denatured with glyoxal, 
electrophoresed on a 1% agarose gel, transferred to 
nitrocellulose, and probed with 32P-labeled ScoRI insert from 
plasmid OCI331S3 or ?CIB3169, all using standard techniques as 
described in Sambrook et al, Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Laboratory Press: New York (1989) . 
Sxposure of this blot to photographic film demonstrates that 

the gene represented by these two clones is only 

transcriptionally active in the pollen (Fig. 32) . 

Identification of a Pollen-Specific Promoter 

Sxample 33. Construction of a Maize Genomic DNA Library 
Genomic DMA from maize line 211D young shoots was 

isolated as described in Shure et 1, Call 35:225-233 (1983) . 

The DNA was provided to Stratagene, where a genomic DNA library 

was constructed by cloning Sau3AI partially digested DNA into 

Stratagene' s Lambda Dash cloning vector. 

Sxample 34 . Genomic DNA Blot Hybridization to Determine Gene 
Copy' Number . 
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Genomic DNA from maize line 211D was digested with a 
number of restriction enzymes, the individual digests 
electrophoresed on an agarose gel, transferred to 
nitrocellulose, and probed with 32P- labeled ZcoRI insert from 
plasmid ?CI33168 (1.0 kb fragment), ?CI33169 (0.5 kb fragment) 
or clone II-. S using standard techniques described in Sambrook 
et al. More than 10 bands were detected by the II- -6 probe on 
most digests, indicating that this cDNA is derived from a 
large, multigene family. Probing with the 1.0 kb fragment 
detected from 3 to 6 bands, and probing with the 0.5 kb 
fragment detected only from 1 to 3 bands which were a subset of 
those detected by the 1.0 kb fragment. Due to the smaller gene 
family size detected by the 1.0 kb and 0.5 kb fragments, it was 
decided to attempt to isolate the genomic clone corresponding 
to them. 

Example 35. Isolation of a pollen-specific genomic clone 

The Stratagene maize 211D genomic library was screened 
by probing plaque lifts with 32P labeled inserts from plasmid 
PCIH3168 (1.0 kb fragment) and pCI33169 (0.5 kb fragment) using 
standard procedures as described in the Stratagene manual 
accompanying the library. Using this strategy, Lambda clone 
MG14 was isolated, and it hybridized to both probes. The 9.0 
kb 3amHI fragment of MG14, which also hybridized to both 
probes, was subcioned into the 3amHI site of pBluescript SK+ to 
create plasmid pCI3379. 1800 bp of pCI3379, in the region 
corresponding to the cDNA sequence, was sequenced as described 
above. Comparison of the cDNA and genomic sequences showed 
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only 91% identity. P CI3379 insert represents a related 
pollen-specific gene. 

A second maize 211D genomic library was constructed in 
the vector lambda GSM-U, purchased from ?r omega, using the 
procedures described in the Promega manual. Screening this 
un-amplified library as above yielded clone GEM11-I, which 
hybridized to both 0.5 and 1.0 kb probes. The 20kb Hindlll 
fragment of GEM11-1, which also hybridized to both probes, was 
subcioned into the Hindlll site of pBluescript SK+ to yield 
PCI331SS. The DNA sequence of 4.1kb of pCIB3166 was determined 
(Fig. 36) and after accounting for six introns in the genomic 
clone, was 100% identical to the cDNA sequence of pCIB3168 and 
PCIB3169. Comparison of the pCIB3166 sequence to the 
Genbank/EMBL database revealed that the 5' portion, through the 
3 exon, was 34.5% identical to rat calmodulin-dependent protein 
kinase II at the amino acid level (Fig. 33), while the fourth 
through seventh exons were 39.4% identical to rat calmodulin. 
See Fig. 34. No other pollen-specific kinase has been 
described, and at the time this a protein combining kinase and 
calmodulin domains was unknown. Subsequently, Harper et al., 
Science 2S2: 951-954 (1991) have disclosed the cDNA sequence of 
a similar protein from soybean, although this gene is not 
pollen-specific in expression. Comparison of the soybean 
calcium-Dependent Protein Kinase (CDPK) and the maize pollen 
CDPK reveals 38% identity at the amino acid level. See Fig. 
35. 

Example 36. Identification of the Promoter's Transcriptional 
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Starr Site by Primer Extension 

Oligonucleotide ?E51, with the following sequence was 
synthesized as a primer. 

5 ' - TGGCCCATGGCTGCGGCGGGGAACGAGTGCGGC - 3 ' 

Primer extension analysis was carried out on polyA+ 
pollen mRNA as described in Metraux et al., PNAS USA 86:896-890 
(1989) . The transcription initiation site was determined to be 
between bases 1415 and 1425 on the partial sequence of pCI33166 
shown in Fig, 36. 

Testing Promoter Function in Transgenic Plants 

Example 37. Construction of promoter vectors for plant 
transformation 

To demonstrate -hat the pollen CDPK promoter can drive 

expression of a linked gene in transgenic plants r a gene fusion 

of the pollen CDPK promoter to the Beta-glucuronidase gene of 

5. coli was constructed as follows. The lOkb BamHI fragment 

from lamoda GEMli-i containing the first exon and part of the 

first intron of the pollen CDPK gene plus 9 Jcb upstream of the 

gene was subcloned into the BamHI site of pBluescript SK+ to 

create plasmid pCIB3167. The 2.3 Jcb BamHI -Hindlll fragment 

from pCIB3167 was subcloned into the 3amHI and Hindlll sites of 

pBluescri|$t SK+ to create plasmid pSK105. The pSKlOS was 

digested with Aval arid Hindlll, and thel.75 kb Hindlll-Aval 

fragment was isolated on an agarose gel . A PCS reaction was 

run under standard conditions as described in Sambrook et al. 

using intact pSK105 as a template and the following primers: 

♦42 : 5 ' -AGCGGTCGACCTGCAGGCATGCGATCTGCACCTCCCGCCG- 3 ' 
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#43 : 5' -ATGGGCAAGGAGCTCGGG-3 

The PCS. reaction products were digested with Aval and 
Sail and the resulting fragment isolated on an agarose gel. 
pBluescript SK+ was digested with Hindlll and Sail. The 1.75 
kb Hindlll-Aval fragment, PCS. derived Aval-Sail fragment, and 
pBluescript vector with Hindlll and Sail ends were ligated in a 
three way ligation to create plasmid pSKHO. 

A fusion of the promoter fragment in pSKHO to the 
Beta-glucuronidase (CDS) gene was created by digesting pSKllO 
with Hindlll and Sail, isolating the i.Skb fragment on an 
agarose gel and ligating it into Hindlll and Sail sites of 
pCIB3054, to create plasmid pKL2, a plasmid derived from ?OC19 
containing the GUS gene followed by plant intron from the maize 
?EPC gene and a polyA signal from cauliflower mosaic virus. 
This promoter fusion was inactive in plants, probably due to 
the presence of out of frame ATG codons in the leader sequence 
preceding the GUS gene ATG. 

A function fusion of the promoter was created by 
digesting ?KL2 with Xbal and Sail to remove the previous fusion 
junction. A new fusion junction was produced in a PCS. reaction 
using pSKlOS as a template and the following primers: 
#SK50: 5' -CCCTTCAAAATCTAGAAACCT-3' 
ISK49: 5 ' -TAATGTCGACGAACGGCGAGAGATGGA- 3 ' 

The PCS product was digested with Xbal and Sail and 
purified on an agarose gel. The purified fragment was ligated 
into the Xbal and Sail sites of pKL2 to created plasmid 
?CIB3171. This plasmid contains a functional fusion of pollen 
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CDPK promorer and GUS which directs expression the GUS gene 
exclusively in pollen. 

To create a vector containing the pollen CDPK 
promoter-GDS fusion suitable for use in Acrobacterium 
tumefaciens- mediated plant transformation, the fusion gene was 
isolated from ?CIB3171 by digestion with Hindi 1 1 and Sail. The 
resulting fragment was ligated into the Hindi 1 1 and Sail sites 
of pBIlOl (purchased from Clontech) to create plasmid pCIB3175. 
Example 38. Production of Transgenic Plants 

pCIB3175 was transformed into Agrobacterium tumefaciens 
containing the helper plasmid pCIB542, and the resulting 
culture used to transform leaf disks from tobacco shoot tip 
cultures as described by Horsch et al., Science 227:1229-1231 
(1985) except that nurse cultures were omitted and selection 
was on 100 mg/1 Jcanamycin. Transgenic plants were regenerated 
and verified for presence of the transgene by PGR. 
Example 39. GUS Gene Expression Analysis 

Pollen from primary transf ormants and their progeny 
were analyzed his tochemi call y for expression of the GUS gene as 
described by Guerrero et al., Mol. Gen. Genet. 224:161-168 
(1990) . The percentage of pollen grains expressing the GUS 
gene, as demonstrated by blue staining in the X-gluc buffer , is 
shown in the table below. 

Plant Number % Blue Pollen 

PP1-51 28% 

PP1-54 54% 

PP1-55 none 

PP1-61 very few 

PP1-53 51% 
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PP1-67 
PP1-80 
PP1-33 



15% 
10% 
12% 



Primary transformants in which, a single pollen CDPK 
promoter-GUS gene was integrated would produce a maximum 50% 
GUS positive pollen due to segregation of the single gene. 

Flouometric GUS assays were done on pollen, stem, root, 
leaf and pistil tissue of selected plants to demonstrate the 
specificity of pollen CDPK promoter expression. Assays were 
performed as described in Jefferson, Plant Mol. Biol. 
14:995-1006 (1990), and GUS activity values are expressed as 
nmoles MU/ug protein/minute. 



Plant 
number 



Tissue GUS Activity 



PP1-51 stem 
leaf 



0.01 

0 

0.15 



root 
pistil 0 . 02 
pollen 0.24 



PP1-54 stem 
leaf 



0.01 

0 

0.13 



root 
pistil 0.01 
pollen 0 . 60 



PP1-63 stem 
leaf 



0.01 
0 

0.07 



root 
pistil 0.01 
pollen 0.57 



Untransformed 
Plant GUS 
Activity 

0.02 

0 

0.10 
0.01 
0.02 

0.02 

0 

0.1 

0.01 

0.02 

0.02 

0 

0.1 

0.01 

0.02 



Net GUS 
Activity 



0 
0 

0.05 
0.01 
0.22 

0 
0 

0.03 

0 

0.56 

0 
0 
•0 
0 



0.55 



Examples 40-50- are directed primarily to the preparation of 
chimeric constructs, i.e. recombinant DNA molecules, containing 
constitutive, tissue-preferred, or tissue- specific promoters 
operably linked to an instant 3.t. gene, insertion of same into 
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vectors, production of transgenic platns containing the 
vectors, and analysis of expression levels of B.t . proteins of 
the transgenic plants. 

EXAMPLE 40: CONSTRUCTION OF MAIZE OPTIMIZED BT 
TRANSFORMATION VECTORS 

To demonstrate the effectiveness of the synthetic Bt 
crylA(b) gene in maize, the PepC and pith specific promoters 
are fused to the synthetic Bt crylA(b) gene using PCR. 
Oligomers designed for the PCR. fusions were: 

(PEPC) 

KE99A28 - 5' -TGCGGTTACC GCCGATCACA.TG-3' 

XE97A28 = 5' -GCGGTACCGC 6TCSACSCSS ATCCCGCGGC GGGAAGCTAAG-3' 

(PITH) 

KE100A28 = S'-GTCGTCG&CC GCAACA-3' 

KE98A28 - 5 ' -GCGGTACCGC GTTAACGCGG ATCCTGTCCG ACACCGGAC-3' 
KE104A28 = 5' -GATGTC3TCS ACCGCAACAC-3' 

XE103A28 » 5' -GCGGTACCGC GGATCCTGTC CGACACCGGA CGGCT-3' 

PCR primers are designed to replace the Nco I sites in 
the 5' untranslated leader region of each of these tissue 
specific genes (containing ATG translatiohal start sites) with 
Bam HI sites to facilitate cloning of the synthetic crylA(b) 
gene into this Bam HI site . Subsequent construction of vectors 
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containing the tissue specific promoters fused to the synthetic 
crylA(b) gene and also containing th 25S:PAT:35S marker gene 
involves several intermediate constructs. 

1. OCIB4406 (35S : synthetic-cry IA (b ) :pepC ivs#9:35S) 

PCIB440S contains the 2 Kb 3am HI\Cla I synthetic 
crylA(b) gene fused with the CaMV 35S promoter (Rothstein et 
al., Gene 53:153-161 (1987)). The gene also contains intron #6 
derived from the maize PEP carboxylase gene (ivs#9) in the 3' 
untranslated region of the gene, which uses the CaMV 3' end. 
( PNAS USA , 83:2884-2888 (1986), Hudspeth et al., Plant 
Molecular Biology , 12: 579-589 (1989)). pCIB4406 is ligated 
and transformed into the "SURE" strain of Ej, coli cells 
(Stratagene, La Jolla, CA) as described above. One mutation is 
found in pCIB4406 / s crylA(b) gene at amino acid #436 which 
resulted in the desired Phe being changed to a Leu. pCIB4406 
is fully active against European com borer when tested in 
insect bioassays and produces a CrylA(b) protein of the 
expected size as determined by western blot analysis. 

2. ?CI34407 { 3SS: synthetic-cry IA <b ) :?epC ivs#9:35S + 

25S:PAT:3£S) 

pCI34407 is made from an approximately 4 Kb Hind 
IIINEco SI fragment containing" the 35S :PAT:35S gene, and the 
3.1 Kb \ Hind III\Eca RI 35S : synthetic-cry IA (b ) :35S gene from 
PCIS4406. pCI34407 is ligated and transformed into "SURE", 
DHSalpha, and HB101 strains of E^ coll using standard 
procedures (SambrooJc et al.) . The synthetic crylA(b) gene has 
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the same properti s as its precursor ?CIB4406. 
2. ?CIB4416 OSS : synthetic-cry LA (b) :pepC ivs#9:3SS + 
35S:PAT:35S + 35S:Adh intron : GUS : 3SS . ) 

PCIB4407 is cur with Eco RI and created with calf 
intestinal alkaline phosphatase (CI?) under standard conditions 
(Sambrook et al.) to produce an about 7.2 Kb fragment that is 
ligated with a 3.4 Kb Eco RI 35S:Adh\GUS:35S fragment to 
produce pCIB4416. Ligations and transformations into "SURE" 
cells is as described above. The synthetic crylA(b) gene in 
?CI34416 has the same properties as the gene in pCIB4406. 

4. pCIB4418 (35S: synthetic-cry IA(b) :pepC ivs#9:3SS) 

PCIB4406 is digested with Apa I and 3am HI and treated 
with CIP. pCIB4406 is digested with 3am HI and Nsp I. 
pBS123#13 is digested with Nsp I and Apa I. A three-way 
ligation is made consisting of a 4.3 Kb Apa I\Bam HI fragment 
from pCIB4406, a 1.3 Kb Bam HI\Nsp I fragment from pCIB4406, 
and a 170 bp Nsp I\Apa I fragment from pBS123#13 to form 
PCIB4418. The host EL coli strain for pCIB4418 is HB101. 

5. pCI34419 (35S:synthetic-cryIA<b) :?epC ivs*9:35S + 

35S:PAT:35S + 3SS:Adh intron : GUS :35S . ) 
PCI3441S and pCIB4418 are digested with Bst E II and 
Eco NI and fragments of pCI34416 are treated with CIP . A 9.1 
Kb fragment from pCIB441S ligated to a 1.4 Kb fragment from 
PCIB4418 to form pCIB4419. pCI34419 transformed in HB101 • 
competent 3^ coli cells demonstrates full activity in insect 
bioassays against European corn borer. 
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6. PC134420 (Pith:synthetic-cryIA(b) :PEPC ivs#9:35S + 
35S:PAT:25S) 

Intermediate constructs in making pCIB4420 are pBTinl, 
P Btin2, p4420A and pfltin3. pBtinl (pita promoter: second half 
of the synthetic 3t gene + 35S:PAT:35S) is made by ligating the 
2.1 Kb Xba I\Nco I pith promoter fragment from plasmid 
pith (3-1) with a 5.2 Kb Xba I\Nco I fragment from pCXB4407. 
pBtin2 is an intermediate construct containing the pith 
promoter modified with a 210 bp PCR fragment made using primers 
KE100A28 and KB98A28 listed above. The PCR reaction mix 
contains approximately 100 ng of a 2.1 Kb Bam HI\Nco I pith 
promoter fragment with 100 pmol of each oligomer, 200 nM of 
each dNTP, 1 X buffer (Catus) and 2.5 units of thermal stable 
polymerase. Since the Tm is relatively low (between 40- and 
50 »C) , PCR reactions are run with the following parameters: 

denaturation cycle: 94 »C for 1 minute 

annealing cycle : 37 *C for 1 minute 

extension cycle : 72-C for 45 seconds (+ 3 seconds per 

cycle) 

number of cycles: 25 

PCS. reactions are treated with proteinase K as 
described above prior to cutting with Sal I\Kpn I followed by 
phenol\chioroform extraction and ethanol precipitation as 
described above. The 210 bp fragment is purified on a 2 % 
Nusieve gel and extracted from the gel using Millipore's filter 
units. Th 210 bp Sal I\Kpn I fragment is ligated to the 4.9 
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Kb Sal I\Kpn I fragmenr from pith (3-1) to make pBtin2. p4420A 
(pith:synth tic-Bt:Pep inrron:2SS + 35S:PAT:35S) is made with a 
three-way ligation consisting of a 700 bp Nsi I\Bam HI fragment 
from pBtin2, a 1.8 Kb Bam HI\Bst 3 II fragment from pCIB4418, 
and a 5.9 Kb Bst 3 II\Nsi I fragment from pBtinl. After p4420A 
is made three mutations are discovered in pBtin2. A second PCR 
fragment is made to modify the Nco I site in the pith leader 
using primers KE104A28 and KE103A28 with Tm values around 65 *C. 
The PCR reaction mix is identical to that listed above with the 
addition of glycerol to 20% to reduce mutations in G+C rich 
areas (Henry et al., Plant Molecular 3ioloav Reporter 
9(2) :139-144, 1991). PCR parameters are as follows: 
File I: 94 # C : 3 minutes , 1 cycle 
File II: 60 # C : 1 minute 

94 # C : 1 minute 

25 cycles 

File III: 72 *C : 5 minutes r 1 cycle 

PCR reactions are treated as above and cut with 
restriction endonucleases Sal I and Kpn I. The 210 bp Sal 
I\Kpn I PQ (glycerol in the reaction) fragment is ligated to 
the 4.9 Kb Sal I\Kpn I. fragment from plasmid pith (3-1) to make 
pBtin3 . Sequence data on pBtin3~G#l shows this PCR generated 
fragment to be correct. 

pBtin3-G#l is used to make pCIB4420 (also called p4420B 
"G#6") . pCIB4420 is constructed with a three-way ligation 
using the 700 bp Nsi I\Bam HI fragment from pBtin3-G#l, a 1.8 
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Kb 3am HI\Bst S II fragment from ?CI34418, and a 5.3 Kb 3st E 
II\Nsi I fragment from pttial. ?CI34420 is used in mesophyll 
protoplast experiments and demonstrates full activity of the 
synthetic crylA(b) gene against European corn borer. 
7. pCIB4413 (PEPC:synthetic-Bt (Phe mutation) :PEPC intron:35S.) 

A fusion fragment is generated by PCR using primers 
KE39A28 and KE97A28 with a 2.3 KB Hind III\Sal I template from 
PGUS4.5. The PCS mix contains the same concentration of 
primers, template, dNTPs, salts, and thermal stable polymerase 
as described above. PCS. reaction parameters are: 

denaturation cycle : 9A*C for 1 minute 

annealing cycle : 55 # C for 1 minute 

extension cycle : 72-C for 45 seconds (+ 3 seconds per 

cycle) 

number of cycles: 30 

After completion, PCR reactions are treated with 
proteinase K followed by phenolNchioroform extraction and 
ethanol precipitation as described above prior to cutting with 
restriction endonucleases Bam HI and 3st E II . 

PCI34413 is made with a three-way ligation using the 
210 bp 3am HXNBst E II PCR fragment, a 4.7 Kb Bam HZ\Hind III 
fragment from pCI34406, and a 2.2 Kb Hind III\Bst E II fragment 
from pGUS4.5. 

8. pCIB4421 (PEPC: synthetic-cry IA(b) :PEPC intron:35S.) 

PCIB4421 is made to replace the synthetic crylA(b) gene 
containing the Phe mutation in pCIB4413 with the synthetic 
crylA(b) gene from pCIB4419. pCIB4421 is made by ligating a 
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5,2 Kb Bam HI \ Sac I fragment from ?CIB4413 with a 1.9 Kb Bam 
HI \ Sac I fragment from pCIB4419. 

9. pCIB4423 <PEPC;synthetic-cryIA(b) :PepC intron:3SS + 
35S:PAT:35S) 

The 2.4 Kb Bam HI \ Hind III ?EPC promoter fragment from 
PCIB4421 is ligated to the 6.2 Kb 3am HI\Hind III fragment in 
?CIB4420 to make pCIB4423. The Hind III site is deleted by 
exonucieases in the cloning of pCIB4423. pCIB4423 contains the 
synthetic crylA(b) gene under the control of the PEPC promoter, 
and the PAT gene under the control of the 3SS promoter, 

10. Synthetic cryIA<b) gene in Aarobacterium strains: 

Acrobacterium strains made with the synthetic cry LA (b) 
gene allow transfer of this gene in a range of dicotyledenous 
plants. Aarobacterium vector pCIB4417 contains the 3.3 Kb Hind 
IIIXEco Rl 35S: synthetic-Cry IA<b) :PepC:ivs#9:35S fragment from 
pCIB4406 (Phe mutation) ligated to the 14 Kb Hind IIIXEco RI 
fragment from pBHOl (Clontech) . Using electroporation, 
PCIB4417 is transferred into the A. cmnefaciens strain LBA4404 
(Diethard at al., Nucleic Acids Research . Voll7:#16:6747, 
1989.) . 

200 ng of pCIB4417 and 40 ul of thawed on ice LBA4404 
competent cell are electroporated in a pre-cooled 0.2 cm 
electroporation cuvette (Bio-Rad Laboratories Ltd.). Using 
Gene Pulser-TM with the Pulse Controller unit (Bio-Rad) , an 
electric pulse is applied immediately with the voltage set at 
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2.5 kW and the capacity set at 25 uf . After the pulse, cells 
are immediately transferred to 1 ml of YSB medium and shaken at 
27 C for 3 hours before plating 10 ul on ABmin:Xm50 plates. 
After incubating at 28 C for approximately 60 hours colonies 
are selected for miniscreen preparation to do restriction 
enzyme analysis. The final Acrobacterium strain is called 
pCIB4417:LBA4404. 

EXAMPLE 41: ELISA ANALYSIS OF TRANSFORMED MAIZE PROTOPLASTS 

The presence of the crylA(b) toxin protein is detected 
by utilizing enzyme-linked immunosorbent assay (ELISA) . ELISAS 
are very sensitive, specific assays for antigenic material. 
ELISA assays are useful to determine the expression of 
polypeptide gene products. Antiserum for these assays is 
produced in response to immunizing rabbits with 
gradient-purified 3t crystals [Ang et ai., Applied Environ. 
Microbiol w 36:625-626 (1978)] solubilized with sodium dodecyl 
sulfate. ELISA analysis of extracts from transiently 
transformed mai2e cells is carried out using standard 
procedures (see for example Harlow f E., and Lane, D. in 
"Antibodies: A Laboratory Manual", Cold Spring Harbor 
Laboratory Press, 1988) . ELISA techniques are further 
described in Clark et al., Metho ds; in Snzvmoloay, 118:742-766 
(1986); *nH Bradford/. Anal, Biochem. , 72:248 (1976) . Thus, 
these procedures are well-fcnown to those skilled in the art. 
The disclosure of these references is hereby incorporated 
herein by reference. 

SLISA assays are performed to detect the production of 
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CrylA(b) protein in maize protoplasts. Protein produced is 
reported below as ng of Bt per mg total protein (ng 3t/mg) . 
Each construct was tested twice. 
PCIB3069 No detectable at (both tests) 
PCIB4407 21,900 ng Bt/mg total protein, 
21,000 ng 3t/mg total protein 
The transformed maize cells produce high levels, on the 
order of approximately 20,000 ng of Bt CrylA(b) protein per mg 
total soluble protein, of the Bt I? when transformed with the 
maize optimized Bt gene. The level of detection of these SLISA 
based assays is about 1 to 5 ng CrylA(b) protein per mg 
protein. Therefore, the maize optimized Bt gene produces as 
much as approximately a 20,000 fold increase in expression of 
this protein in maize cells. 

EXAMPLE 42: ASSAY OF EXTRACT FROM TRANSFORMED PROTOPLASTS FOR 
INSECTICIDAL ACTIVITY AGAINST EUROPEAN CORN BORER 

Western blot analysis is also performed using extracts 

obtained from maize cells which had been transiently 

transformed with DNA to express the maize optimized gene. When 

examined by western blots, this protein appears identical with 

the protein produced in call . In contrast, as demonstrated 

in Example 6 above, no detectable at crylA(b) insecticidal 

protein is produced by maize cells transformed with comparable 

vectors attempting to express the native 3t derived coding 

region. 

Qualitative insect toxicity testing can be carried out 
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using harvested protoplasts. Suspensions are prepared for each 
replicate tested in all bioassays. A replicate is considered 
positive if it causes significantly higher mortality than the 
controls. For example, replicates are tested for their 
activity against insects in the order Lepidootera by using the 
European com borer, Qstrinia nubilalis. One-hundred ul of a 
protoplast suspension in 0.1% Triton x-100 is pipetted onto the 
surface of artificial Blacfc cutworm diet, (Bioserv, Inc., 
Frenchtown, NJ; F9240) in 50 mm X 10 mm snap-cap oetri dishes. 
After air drying 10 neonatal larvae are added to each plate. 
Mortality is recorded after about 4 days. When this protein is 
fed to European corn borers, it produces 100% mortality. 

EXAMPLE 43: EXPRESSION OF SYNTHETIC 3T IN MAIZE MESOPHYLL 
PROTOPLASTS 

The general procedure for the isolation of corn 
mesophyll protoplasts is adapted from Sheen et al. f The Plant 
Cell , 2:1027-1038 (1990) . The protoplast transformation system 
used in Sheen et al. is modified by using PEG mediated 
transformation, rather than electroporation . That procedure, 
as well as changes made in the isolation procedure, is 
described below. 

Maize Mesophyll Protoplast Isolation/Transformation 

1. Sterilize and germinate com seeds for leaf 
material. Seedlings are grown in the light at 25C. 

2. Surface sterilize leaf pieces of 10-12 day old 
seedlings with. 5% Clorox for 5 minutes followed by several 
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washes with sterile distilled water. 

3. Aliquot enzyme solution (see recipe below); 25 
ml/dish (100x25 mm petri dish) . 

4. Remove any excess water from leaves and place 6-8 2 
inch pieces in each dish of enzyme. 14 plates are usually set 
up with the leaf material from about 100 seedlings. 

5. Cut leaves in longitudinal strips as thin as 
possible (2-5 mm) . 

6. Shake slowly at 25C for 6.5 to 7 hours. Cover plates 
so that incubation takes place in the dark. 

7. Before filtering protoplasts, wash 100 um sieves 
with 10 ml 0.6 M mannitol. Pipet protoplasts slowly through 
sieves. Wash plates with 0.6 M mannitol to gather any 
protoplasts left in the dishes. 

8. Pipet filtered liquid carefully into 50 ml sterile 
tubes. Add equal volumes of 0,6 M mannitol to dilute. 

9. Spin for 10 minutes at 1000 rpm/500 g in table-top 
centrifuge (Beckman Model TJ-6) . 

10 . Remove enzyme solution and discard. Resuspend 
pellets carefully in 5 ml mannitol . Pool several pellets . Bring 
volume to 50 ml with 0.6 M mannitol and spin. 

11. Resuspend to a known volume (50 ml) and count. 

12. After counting and pelleting* resuspend protoplasts 
at 2 million/ml in resuspending buffer (recipe below) . Allow 
opts to incubate in the resuspending buffer for at least 30 min 
before transformation. 

Transformation: 
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1. Aliquot plasmids to tubes (Fisherbrand polystyrene 
17 x 100 mm Snap Cap culture tubes) ; at least three replicates 
per treatment; use equimolar amounts of plasmids so that equal 
gene copy numbers are compared. 

2. Add 0.5 ml protoplasts and 0.5 ml 40% PEG made with 

0.6 m mannitol. 

3. Snake gently to mix and incubate at 25C for 30 min. 

4. Add protoplast culture media at 5 min intervals: 

1,2,5 ml 

5. Spin for 10 min at 1000 rpm/500 g. 

6. Remove liquid from pellet and resuspend in 1 ml 
culture media (BMV media) 

7. Incubate overnight at 25C in the dark. 

Recipes: 
Enzyme Solution 
0.6 M mannitol 
10 mM MES, pH 5.7 
1 mH Cad^ 
1 mM MgC^ 
0.1% 3SA 
filter-sterilize 

To this solution, add the following enzymes: 
1% Cellulase RS, and 0.1% Macerozyme R10 

Wash Buffer: 0.6 M mannitol, filter-sterilize 
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Resuspending 3uffer: 0.6 M mannitol, 20 mM KC1, 
filter-sterilize 

Culture Media: BMV media recipe from: 

Okuno et al., Phytopathology 67:610-615 (1977). 

0.6 M mannitol 

4 raM MES, pH 5.7 

0. 2 mM KH-PO- 

2 4 

1 mM KNO-, 

1 mM MgS0 4 

10 mM CaCl 2 

IX K3 micronutrients 

filter-sterilize 

SLISA analysis of transformed protoplasts is done one 
day after transformation. ELISA's are done as previously 
described. Tlie following tliree experiments are done with maize 
inbred line 211D. Of course, other lines of maize may be used. 
50 ug of plasmid pCIB4419 and equimolar amounts of other 
plasmids are used. Total soluble protein is determined using 
the BioRad protein assay. (Bradford, Anal.Biochem , 72:248 
(1976) . 

Transformation Experiment : 
Constructs tested: 

1. ?CI34419 (Construct contains synthetic* Bt under control of 
CaMV 25S promoter and 35S/PAT and 25S/GOS marker genes) 

2. pCI34420 (Construct contains synthetic Bt under control of 
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Pita promoter and PAT marker gene) 

3. pCIB4421 (Construct contains synthetic Bt under control of 
PBEC promoter) 

4. pCIB4423 (Construct contains synthetic Bt under control of 
PEPC promoter and PAT marker gene) 

(PEPC : synthetic-cry (b) :PepC intron:35S + 3SS:PAT:35S) 

In the following experiments, 10 or 11 day old 211D 
seedlings are analyzed for production of tiie Bt CrylA(b) 
protein in the Biorad protein assay: 

Experiment 1 (11 day seedlings) : 

PCIB4419 15,000 ± 3,000 ng 3t/mg protein 

pCIB4420 280 ± 65 ng Bt/mg protein 

PCIB4421 9,000 * 300 ng 3t/mg protein 

Experiment 2 (10 day seedlings) : 

PCIB4419 5,000 ± 270 ng 3t/mg protein 

pCIB4420 80 ± 14 ng Bt/mg proteS-n 

pCIB4421 1,500 * 220 ng Bt/mg protein 



Experiment 3 (11 day seedlings) : 



PCIB4419 



21,500 ± 1,800 ng Bt/mg protein 
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260 ± 50 ng 3t/mg protein 
11,900 ± 4,000 ng 3t/mg protein 
7,200 i 3,400 ng 3t/rag protein 

The above experiments confirm that both the CaMV 35S 
and PEPC promoters express the synthetic 3t CrylA(b) protein at 
very high levels. The pith promoter, while less efficient, is 
also effective for the expression of synthetic CrylA(b) 
protein • 

EXAMPLE 44: STABLE EXPRESSION OF SYNTHETIC 3T IN LZTTOCS 

The synthetic 3t gene in the Aarobacterium vector 
PCIB4417 is transformed into Lactuca sativa cv. Redprize 
(lettuce) . The transformation procedure used is described in 
Enomoto et al.. Plant Call Reports , 9:6-9 (1990). 
Transformation procedure: 

Lettuce seeds are suface sterilized in 5% Clorox for 5 
minutes followed by several washes in sterile distilled water. 
Surface-sterilized seeds are plated on half strength MS media 
(Murashige and Skoog, Physiol. Plant . 15:473-497 (1962)). 
Cotyledons of 6-day-old Redprize seedlings, grown under 
illumination of 3,000 Ix 16 hr at 2SC r are used as the explants 
for Aarobacterium infection. The base and tip of each 
cotyledon are removed, with a scalpel. The explants are soaJced 
for 10 minutes in the bacterial solution which have been 
cultured for 48 hours in AB minimal media with the apropriate 
antibiotics at 28C. After blotting excess bacterial solution 
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on sterile filter paper, the explants are plated on MS media 
(0.1 mg/1 3A and 0.1 mg/1 NAA) for 2 days. Explants are then 
transferred to selective media containing 500 mg/1 
carbenicillin and 50 mg/1 kanamycin. The explants are 
sub cultured to fresh media weekly. The growth chamb er 
conditions are 16 hour 2,000 lx light at 25C. After 
approximately 4 weeks, an BLISA is done on healthy looking 
callus from each of four plates being sub cultured. The SLISA 
procedure is the same as described above for protoplasts; 
soluble protein is again determined by the 3iorad assay 
described above. 
Results: 

PCIB3021 (Jean control) 0 
pCIB4417 (plate 1) 0 

pCIB4417 {plate 2) 505 ng Bt/mg protein 

PCI34417 (plate 3) 45 ng Bt/mg protein 

PCIB4417 (plate 4) 1,200 ng Bt/mg protein 

This example demonstrates that dicot plants can also show 
increased expression of the optimized insecticidal gene. 
Example 45. Construction of pCIB4429. 

PCXB4429 contains a preferred maize pollen-specific 
promoter fused with the ssaize optimized crylA(b) gene. The 
oollen-specif ic. staize promoter used in this construct was 
obtained from the plasmid pKL2, described in Example 37. The 
maize optimized crylA(b) gene was obtained from plasmid 
pCI34418, also described in Example 37. 

pKL2 is a plasmid that contains a preferred maize 
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pollen-specific promoter fused with the coli 
beta-glucuronidase gene. It was constructed from plasmids 
pSKHO and ?CIB3054. pSKHO contains the pollen specific maize 
promoter. pCIB3054, a pUC19 derivative, contains the coli 
beta-glucuronidase (GUS) gene fused with the cauliflower mosaic 
virus (CaMV) 3SS promoter. It's construction is described 
elsewhere in this application. This promoter can be removed 
from this plasmid by cutting with Sall/Hindlll to yield a 
fragment containing the GUS gene, a bacterial ampicillin 
resistance gene and a ColEI origin of replication. A second 
fragment contains the CaMV 35S promoter. 

PCIB3054 was cut with the restriction enzymes Sail and 
HindXXl, using standard conditions, for 2 hours at room 
temperature . The reaction was then extracted with 
phenol/ chloroform using standard conditions and the DNA 
recovered by ethanol precipitation using standard conditions. 
The recovered DNA was resuspended in buffer appropriate for 
reaction with calf intestinal alkaline phosphatase (CIP) and 
reacted with 2.5 units of CI? at 37 «c overnight. After the CIP 
reaction, the DNA was purified on an agarose gel using standard 
conditions described elsewhere in this application. pSKHO was 
cut with Sall/Hindlll under standard conditions for 2 hours at 
room temperature and the DNA subsequently purified on an 
agarose gel using standard conditions. The recovered DNA 
fragments were iigated using standard conditions for two hours 
at room temperature and subsequently transformed into competent 
S. coli strain HB101 cells using standard conditions. 
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Transformants were selected on L-agar containing 100 ug 
ampiciliin/ml. Transf ormanrs were characterized for the desired 
plasmid construct using standard plasmid mini -screen 
procedures . The correct construct was named pKL2 . 

To make ?CIB4429, a three way ligation was performed 
using standard conditions known to those in the art. The three 

fragments ligated were: 

1) a Hindlll/BamHI fragment from pCIB4418, of about 4.7 
kb in size, containing the crylA(b) gene, the bacterial 
ampicillin resistance gene, and the ColSI origin of replication 

2) a Hindlll/Xbal fragment from pKL2 of about 1.3 kb in 
size and containing the pollen specific promoter from maize 

3) a PCS. generated fragment derived from the pollen 
promoter with a 3amHI site introduced downstream from the start 
of transcription. This fragment is approximately 120 bp and 
has ends cut with the restriction enzymes Xbal/BamHI. 

The PCS. fragment was generated using a 100 ul reaction 
volume and standard conditions described above. The primers 
used were: 

SKS0: 5'-CCC TTC ASA ATC TAG AAA CCT*3' 

KE127: 5'-GCG GAT CCG GCT GCS GCG GGG AAC GA-3' 

The above primers were wixed in a PCS. reaction with 
plasmid pSKlOS, a. plasmid that contains the pollen specific 
promoter Sxoot smize. 

After \the ''?CSL reaction was complete, 10 ul of tlie 
reaction was run on an agarose gel, using standard condition, 
to make sure tie reaction produced the expected size product. 
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The remaining 90 ul was treated with proteinase X at a final 
concentration of 50 ug/nil for 30 aiin. at 37 »C. The reaction was 
then heated at S5 # C for 10 inin., then phenol/ chloroform 
extracted using standard procedures. The DNA was recovered from 
the supernatant by precipitating with two volumes of ethanol 
using standard conditions. After precipitation, the DNA was 
recovered by centrifuging in a microfuge. The pellet was rinsed 
one time with 70% ethanol (as is standard in the art), briefly 
dried to remove all ethanol, and the pellet resuspended in 17 
Ul TE buffer. 2 ul of 10X restriction enzyme buffer was added 
as were 0.5 ul Bamfil and 0.5 ul Xbal. The DNA was digested for 
i hour at 37«C to produce a DNA fragment cut with Xbal/BamHI. 
After digestion with the restriction enzymes, this fragment was 
purified on an "agarose gel composed of 2% NuSieve <FMC) /1% 
agarose gel. Millipore filter units were used to elute the DNA 
from the agarose using the manufacturer's specifications. After 
elution, the DNA was used in the three-way ligation described 
above . 

After ligation, the DNA was transformed into competent 
IL. coli strain HB101 cells using standard techniques. 
Transformants were selected on L-agar plates containing 
ampicillin at 100 ug/ml. Colonies that grew under selective 
conditions were' characterized for plasmid inserts using 
techniques standard in the art. 

EXAMPLE 46. Construction of pCIB4431, a vector for tissue 
specific expression of the synthetic crylA(b) gene in plants. 

pCIB4431 is a vector designed to transform maize. It 
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contains two chimeric Bt endotoxin genes expressible in maize. 
These genes are the PEP carboxylase promoter/synthetic-cry IA(b) 
and a pollen promoter / synthetic-cry IA(b) . The PBP 
carboxylase/ cry IA(b) gene in this vector is derived from 
PCIB4421 described above. The pollen promoter is also described 
above. Fig. 20 is a map of plasmid pCIB4431. ?CIB4431 was 
constructed via a three part ligation using the about 3.5 Kb 
Kpn I /Hind III fragment (containing 

pollen/synthetic-cry IA(b) from pCIB4429, the about 4.5 Kb Hind 
III/Eco RI (PEPC/synthetic-crylA (b) and the about 2.6 Kb Kpn 
I/Eco RI fragment from the vector 31uescript. 

Other vectors including the pollen promoter/ synthetic 
CrylA(b) chimeric gene include ?CIB4428 and pCIB4430. See Figs. 
21 and 22. pCIB4430 also contains the ?EPC/synthetic-Bt gene 
described above. 

EXAMPLE 47. Production of transgenic maize plants containing 
the synthetic maize optimized CrylA(b) gene 

The example below utilizes Biolistics to introduce DHA. 
coated particles into maize. cells, from which transformed 
plants are generated. 
Experiment KC-65 

Productien of transgenic maize plants expressing the synthetic 
crylA(b) gene uainy; a tissue-specific promoter. 

Tissue ' . 

Immature maize v^ryoy, approximately 1.5-2.5 mm in length, 
were excised from an ear of genotype 6N615 14-15 days after 
pollination. The mother plant was grown in the greenhouse. 



144 



WO 93/07278 



PCT/US92/08476 



Before excision, the ear was surface sterilized with 20% Clorox 
for 20 minutes and rinse 3 times with sterile water. 
Individual embryos were plated scutellum side in a 2 cm square 
area, 36 embryos to a plate, on the callus initiation medium, 
2DG4 + 5 chloramben medium (N6 major salts, 35 minor salts, MS 
iron, 2% sucrose, with 5 mg/1 chloramben, 20 mg/1 glucose, and 
10 ml G4 additions (Table 1) added after autoclaving. 

TABLE 1 - G4 Additions 



Ingredient per liter medium 

Casein hydrolysate 0.5 gm 

Proline 1.38 gm 

Nicotinic acid .2 mg 

Pyridoxine-HCl . 2 mg 

Thiamine-HCl .5 mg 

Choline-HCl . i mg 

Riboflavin .05 mg 

Biotin . i mg 

Folic acid .05 mg 

Ca pantothenate .1 mg 

p-aminobenzoic acid .05 mg 

312 .136 ug 



Bombardment 

Tissue was bombarded using the PDS-lOOOHe Biolistics 
device. The tissue was placed on the shelf 8 cm below the 
stopping screen shelf. The tissue was shot one time with the 
DNA/goid microcarrier solution, 10 ul dried onto the 
macrocarrier. The stopping screen used was hand punched at 
ABRU using 10x10 stainless steel mesh screen^ Rupture discs of 
1550 psi value were used. After bombardmsiit ; , the embryos were 

cultured in the dark at 25* C. ! . 

•'/•..• • • 
" V . ■ ■ . ■/ • 

Preoaraticn of DNA for delivery 
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The microcarrier was prepared essentially according to 
the instructions supplied with the 3iolistic device. While 
vortaxing 50 ul 1.0 u gold microcarrier, added 5 ui ?CI34431 
(1.23 ug/uD [#898] + 2 ul PCIB3064 0.895 ug/uU [#456] 

followed by 50 ul 2.5 M CaCl 2 , then 20 ul 0-1 M spermidine 
(free base, TC grade) . The resulting mixture was vortexed 3 
minutes and microfuged for 10 sec. The supernatant was removed 
and the icrocarriers washed 2 times with 250 ul of 100% EtOH 
(HPLC grade) by vortexing briefly, centrifuging and removing 
the supernatant. The microcarriers are resuspended in 65 ul 
100% StOH. 

Callus formation 

Embryos were transferred to callus initiation medium 
with 3 mg/1 PPT 1 day after bombardment. Embryos were scored 
for callus initiation at 2 and 3 weeks after bombardment. Any 
responses were transferred to callus maintenance medium, 2DG4 * 
0.5 2,4-D medium with 3 mg/L PPT. Callus maintenancs medium is 
N6 major salts, 35 minor salts, MS iron, 2% sucrose, with 0.5 
mg/1 2,4-D, 20 mg/1 glucose, and 10 ml G4 additions added after 
autoclaving. Embryogenic callus was subcultured every 2 weeks 
to fresh maintenancs medium containing 3 mg/L PPT. All callus 
was incubated in' the. dark at 25 »C. 

The Typs I callus formation response was 15%. Every 
embryo which, pstartubed callus was cultured as an individual 
event giving rise to an individual line. 
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Regeneration 

After 12 weeies on selection, the tissue was removed 
from callus maintenance edium with PPT and was placed on 
regeneration medium. Regeneration medium is 0.25MS3S5BA (0.25 
mg/1 2,4 D, 5 mg/1 aAP, MS salts, 3% sucrose) for 2 weeks 
followed by subculture to MS3S medium for regeneration of 
plants. After 4 to 10 weeks, plants were removed and put into 
GA 7's. Our line KCS5 0-6, which became the #175 BT event, 
produced a total of 38 plants. 

Assays 

All plants, as they became established in the GA7 / s, 
were tested by the chlorophenol red cat) test for resistance to 
PPT as described in U.S. Patent Application 07/759,243, filed 
Septefaber 13, 1991, the relevant portions of which are hereby 
incorporated herein by reference. This assay utilizes a pfi 
sensitive indicator dye to show. which cells are growing in the 
presence of PPT. Cells which grow produce a pH change in the 
media and turn the indicator yellow (from red) . Plants 
expressing the resistance gene to PPT are easily seen in this 
test. (#176 ~ 3 positive/30 negative) Plants positive by the 
CR test were assayed by PCR for the presence of the; synthetic 
BT gene. (#176 » 5 positive/2 negative/1 dead) 

Plants positive by PCR for the syn~3T gehe w^re sent to 
the phytotron. Once established in the phytofcron, they were 
characterized using insect bioassays and ELISA analysis. 
Plants were insect bioassayed using a standard European Corn 
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Borer assay (described in Example 5A) in which small pieces of 
leaf of clipped from a plant and placed in a small petri dish 
with a number of SCB neonate larvae. Plants are typically 
assayed at a height of about S inches. Plants showing 100% 
mortality to EC3 in this assay are characterized further. SLISA 
data are shown below. Positive plants are moved to the 
greenhouse . 

Greenhouse/Fertilitv 

Plant number #176-11 was pollinated with wild-type 
6NS15 pollen. One tassel ear and one ear shoot were produced. 
All of the embryos from the tassel ear (11) and 56 kernels from 
Ear 1 were rescued. 294 kernels remained on the ear and dried 
down naturally. 

Pollen from #176-11 was outcrossed to various maize 
genotypes 5N984, 5NA89 f and 3N961. Embryos have been rescued 
from all 3 outcrosses (5N984 = 45; 5NA89 = 30; 3N961 = 8) . 
Most of the kernels remained on the ears on the plants in the 
greenhouse and were dried down naturally. 

DNA was isolated from plant #176-11 using standard techniques 
and analysed by Southern blot analysis. It was found to 
contain sequences which hybridize with probes generated from 
the synthetic cry IA(b) gene and with a probe generated from the 
PAT gene. These results showed integration of these genes into 
the genome of maize. 



Experiment KC-64 
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Production of transgenic maize piants expressing the. synthetic 
crylA(b) gene using a constitutive promoter. 



Tissue 

Immature maize embryos, approximately 1.5-2.S mm in 
length, were excised from an ear of genotype 6N61S 14-15 days 
after pollination. The mother plant was grown in the 
greenhouse. Before excision, the ear was surface sterilized 
with 20% Clorox for 20 minutes and rinse 3 times with sterile 
water. Individual embryos were plated scutellum side in a 2 cm 
square area, 36 embryos to a plate, on the callus initiation 
medium, 2DG4 + 5 chloramben medium (N6 major salts, 35 minor 
salts, MS iron, 2% sucrose, with 5 mg/1 chloramben, 20 mg/i 
glucose, and 10 ml G4 additions Table 1) added after 
autociaving. 

TABLE 1 - G4 Additions 

Ingredient per liter medium 

Casein hydrolysate . 0.5 gm 

Proline 1.38 gm 

Nicotinic acid .2 mg 

Pyridoxine-HCl .2 mg 

Thiamine-HCl .5 mg 

Choline-HCl . 1 mg 

Riboflavin .05 mg 

Biotin .1 mg 

Folic acid .05 mg 

Ca pantothenate -1 mg 

p-aminobenzoic acid .05 mg 

312 .136 U g 

Bombardment 

Tissue was bombarded using the PDS-lOOQHe Biolistics 
device. The tissue was placed on the shelf 8 cm below the 
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stopping screen shelf. Tne tissue was shot one time with the 
DNA/gold micro-carrier solution, 10 ul dried onto the 
macrocarrier. The stopping screen used was hand punched at 
ABRU using 10x10 stainless steel mesh screen. Rupture discs of 
1S50 psi value were used. After bombardment, the embryos were 
cultured in the dark at 25 • C. 



Preparation of DNA for delivery 

The microcarrier was prepared essentially according to 
the instructions supplied with the Biolistic device. While 
vortexing 50 ul 1.0 u gold microcarrier, added 3.2 ul pCIB4418 
(0.85 ug/ul) [#905] + 2 ul PCIB3064 0.895 ug/ul) [#456] + 

l.S ul PCIB3007A (1.7 ug/ul) [US2] followed by 50 ul 2.5 M 
CaCl 2 , then 20 ul 0.1 M spermidine (free base, TC grade). The 
resulting mixture was vortsxsd 3 minutes and microfuged for 10 
sec. The supernatant was ixnoved and the microcarriers washed 
2 times with 250 ul of 100% EtOH (HELC grade) by vortexing 
briefly, centrifuging and removing the supernatant. The 
microcarriers are resuspended in 65 ul 100% SfcOH. 

ran us formation 

Embryos were transferred za callus initiation medium 
with 3 mg/1 PPT 1 day after bombardment. Embryos were scored 
for callus initiation at 2 and 3 weeks after bombardment. Any 
responses were transferred to callus maintenance medium, 2DG4 + 
0.5 2,4-D medium with 3 mg/L PPT. Callus maintenance medium is 
N6 major salts, 35 minor salts, MS iron, 2% sucrose, with 0.5 
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mg/1 2,4-D, 20 rag/1 glucose, and 10 ml G4 additions added after 
utociaving. Embryogenic callus was subcultured every 2 weeks 
to fresh maintenance medium containing 3 mg/L PPT. All callus 
was incubated in the dark at 25 *C. 

The Type I callus formation response was 18%. Every 
embryo which produced callus was cultured as an individual 
event giving rise to an individual line. 

Regeneration 

After 12 weeks on selection, the tissue was removed 
from callus maintenance medium with PPT and was placed on 
regeneration medium and incubated at 25 • C using a 15 hour 
light (50 u£ ..m-2 . s-1) / 8 hour dark photoperiod. 
Regeneration medium is 0.25MS3S5BA (0.25 mg/1 2,4 D, 5 mg/1 
BAP, MS salts, 3% sucrose) for 2 weeks followed by subculture 
to MS3S medium for regeneration of plants. After 4 to 10 
weeks, plants were removed and put into GA 7's. Our line KC64 
0-1, which became the #170 BT event, produced 55 plants. Our 
line KCS4 0-7, which became the #171 BT event, produced a total 
of 33 plants. 

Assays 

Eleven plants, as they became established in the GA7's, 
were tasted by the chlorophenol red (CR) test for resistance to 
PPT as per Shiliito, et al, above. This assay utilizes a pfl 
sensitive indicator dye to show which cells are growing in the 
presence of PPT. Cells which grow produce a pH change in the 
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media and turn the indicator yellow (from red) . Plants 
expressing the resistance gene to PPT are easily seen in this 
test. Plants positive by tne CR test were assayed by PCS. for 
tne presence of the synthetic 3T gene. (Event 170 = 37 
positive/18 negative; #171 - 25 positive/8 negative) . 

Plants positive by PCR for the syn-3t gene were sent to 
the phytotron. Once established in the phytotron, they were 
characterized using insect bioassays and ELISA analysis. 
Plants were insect bioassayed using a standard European corn 
borer assay (see below) in which small pieces of leaf of 
clipped from a lant and placed in a small petri dish with a 
number of EC3 neonate larvae. Plants are typically assayed at a 
height of about 6 inches. Plants showing 100% mortality to ECB 
in this assay are characterized further. ELISA data are shown 
below. Positive plants are moved to the greenhouse. 

Basta screening 

Sight of the mature plants from the #170 event were 
selected for evaluation of Basta [Hoechst] resistance. On one 
middle leaf per plant, an area approximately 10-14 cm long X 
the leaf width was painted with 0, 0.4, 1.0 or 2.0% (10 ml of 
200 g/L diluted to 100 ml with deionized water) aqueous Basta 
containing 2 drops of Tween 20/100 ml. Two plants were tested 
per level. Eight wild-type SN615 plants of the same 
approximate age were treated as controls. All plants were 
observed at 4 and 7 days. All of the control plants eventually 
died. Throughout the study, none of the #170 plants displayed 
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any damage due to the herbicide. 

Pollination 

All tassel ears, first ear and, if available, the 
second ear on the #170 and #171 plants were pollinated with 
wild-type SN615 pollen. At least 90% of the plants were female 
fertile. 

Pollen from #171 plants was outcrossed to genotypes 
6N615, 5N984, 5NA89, 6F01Q, 5NA56, 2N217AF, 2ND01 and 3N961. 
At least 90% of the plants were shown to be male fertile. 

Smbrvo Rescue 

Embryos from the #171 event have been "rescued." 
Fourteen to 16 days after pollination, the ear tip with 25-50 
kernels was cut from the ear with a coping saw. Prior to 
cutting, the husks were gently peeled away to expose the upper 
portion of the ear. The cut end of the ear on the plant was 
painted with Captan fungicide and the husks replaced. The seed 
remaining on the plant was allowed to dry naturally. 

The excised ear piece was surface sterilized with 20% 
Clorox for 20 minutes and rinsed 3 times with sterile water. 
Individual embryos were excised and plated scutelium side up on 
35 medium [Gamborg] containing 2% sucrose. 35 itamins are 
added to the medium after autociaving. Four embryos were 
piated per GA7 container and the containers incubated in the 
dark. When germination occurred, the containers were moved to 
a light culture room and incubated at 25 • C using a 16 hour 
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lighr (50 u£ .m-2 . s-l) / 3 hour dark photoperiod. The 
germination frequency is 94%. 

Progeny from 15 plants of the #171 event and 2 of the 
#176 event were rescued using standard embryo rescue techniques 
and evaluated. All plants were evaluated by insect assay. 
Plants from the #171 event were also tested in the 
histochemical OTS assay. In both the insect assay and the GUS 
assay, the ratio of segregation of the transgenes was 1:1, as 
expected for a single locus insertion event. 

EXAMPLE 48. Analysis of transgenic maize plants 
3LISA ASSAY 

Detection of crylA(b) gene expression in transgenic 
maize is monitored using European corn borer (ECS) insect 
bioassays and SLISA analysis for a quantitative determination 
of the level of cryL&tts) protein obtained. 
Quantitative determination of crylA(b) IP in the leaves of 
transgenic plants was performed using enzyme- linked 
immunosorbant assays (SLISA) as disclosed in Clark M F, Lister 
r M, Bar-Joseph M: SLISA Techniques. In: Weissbach A, Weissbach 
H (eds) Methods in Snzvmoloay 118:742-766, Academic Press, 
Florida (1986) . Immunoaf f inity purified polyclonal rabbit and 
goat antibodies specific for the 3. r. hurinaiensis subsp. 
kurstaki I? were used to determine ng IP per mg soluble protein 
from crude extracts of leaf samples. The sensitivity of the 
double sandwich SLISA is 1-5 ng IP per mg soluble protein using 
50 ug of total protein per SLISA microtiter dish well. 

154 



WO 93/07278 



PCT7US92/08476 



Corn extracts were made by grinding leaf tissue in 



gauze lined plastic bags using a hand held ball-bearing 
homogenizer (AGDIA, Elkart IN.) in the presence of extraction 
buffer (50 mM Na 2 C0 3 pH 9.5, 100 mM NaCi, 0.05% Triton, 0.05% 
Tween, 1 mM PMSF and 1 uM leupeptin) . Protein determination was 
performed using the Bio-Rad (Richmond, CA) protein assay. 



trans formants described above were analyzed for the presence of 
crylA(b) protein using ELISA. These plants varied in height 
from 6 inches to about three feet at the time of analysis. 



Using the above procedure, the primary maize 



Plant 



Bt ng/mg soluble protein 



5/27/91 



176-8 

176-10 

176-11 

171-4A 

171-6 

171-8 

171-9 

171-13 

171-14A 

171-14B 

171-15 

171-16A 

171-16B 

171-18 

176-30 

171-32 

171-31 

171-30 

71-14 



0 

700 
760 
59 
50 
60 
280 
77 
43 
60 
55 
13 
19 
19 



980 
166 
370 



1160 



0 

1585 
2195 



#10 leaf 
1 leaf 




plant 171-16 
#9 leaf 
#1 leaf 



40 

120 
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EUROPEAN CORN 30RER ASSAY 

1. One to four 4 cm sections are cut from an extended leaf of a 
corn plant. 

2. Each leaf piece is placed on a moistened filter disc in a 50 
X 9 mm petri dish. 

3. Five neonate European corn borer larvae are placed on each 
leaf piece. (Making a total of 5-20 larvae per plant.) 

4. The petri dishes are incubated at 29.5 »C. 

5. Leaf feeding damage and mortality data are scored at 24, 48, 
and 72 hours. 

EXAMPLE 49. Expression of 3t endotoxin in progeny of 
transformed maize plants 

The transformed maize plants were fully fertile and 
were crossed with several genotypes of maize. Progeny from 
these crosses were analyzed for their ability to kill European 
corn borer (EC3) in a standard EC3 bioassay (described 
immediately above) as well as for the presence of the crylA(b) 
protein using SLISA as described above. The ability to kill ECS 
and the production of crylA(b) protein correlated. These traits 
segregated to the progeny with a 1:1 ratio, indicating a single 
site of insertion for the active copy of the synthetic gene. 
This 1:1 ratio was true for both the constitutive 
promoter/ synthetic-cry IA(b) plants and the tissue specific 
promoter/ synthetic-cry IA (b) plants (data not shown) . 

Fig. 23A is a table containing a small subset of the 
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torai number of progeny analyzed. This cable is representative 
of a number of different crosses. 

Insect assays were done with Diatrea saccharalis and 
Ostrinia nubilalis using leaf material (as described above) of 
transgenic progeny containing a maize optimized CrylA(b) gene. 
The results of these assays are shown in Fig. 23B. They 
demonstrate that the maize optimized CrylA(b) gene functions in 
transformed maize to provide resistance to Sugarcane borer and 
Ostrinia nubilalis . 

EXAMPLE 50. EXPRESSION OF THE CRY I A (b) GENE IN MAIZE POLLEN 

Progeny of the transformed maize plants containing the 
chimeric pollen promoter/ synthetic crylA(b) gene derived from 
PCIB4431 were grown in the field to maturity. Pollen was 
collected and analyzed for the presence of the crylA(b) protein 
using standard EL ISA techniquesd as described elsewhere. High 
levels of crylA(b) protein were detected in the pollen. 
Progeny from the 35S promoter/ synthetic cry LA (b) transformed 
plant were grown in the greenhouse. Pollen from these plants 
was analyzed using ELISA, and cryLA(b) protein was detected. 
Results are shown below in Figure 23C. 
It is recognized that factors including selection of 
plant lines, plant genotypes , synthetic sequences and the like, 
may also affect expression. 

EXAMPLE 51. EXPRESSION OF THE CRYIA(b) GENE FUSED TO A 
PITH-PREFERRED PROMOTER. 

?CIB4433 (Fig. 36) is a piasmid containing the maize 

optimized CrylA(b) gene fused with the pith-preferred promoter 
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isolated from maize. Tliis plasmid was constructed using a 
three-way ligation consisting of: 

1) OCI34418, cut with 3stSII and 3amHI; 1.8 Kb fragment 

2) pBtinl, cut witii Nail and BstSII; 5.9 Kb fragment; 
pBtial is described elsewhere in this application 

3) ?CR fragment VI- 151 was generated in a PCR reaction 
using standard conditions as described elsewhere in this 
application. 

PCS. primers utilized were: 

KE150A28: 5' -ATT CGC ATG CAT GTT TCA TTA TC-3' 

KE151A28: 5' - GCT GGT ACC ACG GAT CCG TCG CTT CTG TGC AAC AAC 

C-3' 

After the PCS. reaction, the DNA was checked on an 
agarose gel to make sure the reaction had proceeded properly. 
DNA was recovered from the PCR reaction using standard 
conditions described elsewhere and subsequently cut with the 
restriction enzymes NsiZ and Bamfil using standard condition. 
After cutting, the fragment was run on a 2% NuSieve gel and the 
desired band recovered as described elsewhere. The DNA was 
used in the ligation described above. 

After ligation (under standard condition) , the DNA was 
transformed into competent S^. coli cell. 

Transformation was carried out using microprojectile 
bombardment essentially as described elsewhere in this 
application. Embryos were transferred to medium containing 
lOaug/ml PPT 24 hours after microprojectile bombardment. 
Resulting callus was transferred to medium containing 40 ug/ml 
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PPT after four weeks. Plants were regenerated witiiout 
selection. 

A small sample of plants (2-5) was assayed by PCR for 
each event. Further codes were added to indicate different 
positions and distances of embryos with respect to the 
microprojectile bombardment device. Plants were sent to the 
greenhouse having the following codes: 

JS21A TOP Plants 3^t. PCR Positive 

JS21A MID Plants B.t . PCR Positive 

JS21C 30T Plants B.z . ?CR Positive 

JS22D MID Plants B.t. PCR Positive 

JS23B MID Plants B.t. PCR Negative (for control) 

Leaf samples from the regenerated plants were 
bioassayed for insecticidal activity against European corn 
borer as described in Example 48 with the results shown in Fig. 
23D . 

SLISA analysis of leaf samples to quantify the level of 
CrylA(b) protein expressed in the leaves was carried out as 
described in Example 48 with the results shown in Fig. 24E. 

Deposits 

The following plasmids have been deposited with the 
Agricultural Research Culture Collection (NRRL) (1818 N. 
University St., Peoria, IL S1S04) under the provisions of the 
Budapest Treaty: pCI34418, pCIB4420, pCI34429, pCI34-431, 
?CIB4433, PCI35601, pCI33lS6 and pCIB3171. 
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The present invention has been described with reference 
to specific embodiments thereof; however it will be appreciated 
that numerous variations, modifications, and embodiments are 
possible. Accordingly, all such variations, modifications and 
embodiments are to be regarded as being within the spirit and 
scope of the present invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Svola, Stephen V. 

Crossland, Lyle 0. 
Wright, Martha S. 
Merlin, Ellis J. 
Launis, Karen L. 
Rothstein, Steven J. 

(ii) TITLE OF INVENTION: SYNTHETIC DNA SEQUENCE HAVING ENHANCED 
INSECTICIDAL ACTIVITY IN MAIZE 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: CIBA-GEIGY Corporation 

(B) STREET: 7 Skyline Drive 

(C) CITY: Hawthorne 

(D) STATE: New York 

(E) COUNTRY: USA 

(F) ZIP: 10S32 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: 

(B) REGISTRATION NUMBER: 

<C> REFERENCE/DOCKET NUMBER: CGC 1577 /CIP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (919)541-8615 

(B) TELEFAX: (919)541-8689 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4360 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus tlxuringiensis 

(B) STRAIN: KurstaJci HD-I 

(C) INDIVIDUAL ISOLATE: CrylA(b) insecticidal crystal 

protein gene 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTTAACACCC TGGGTCAAAA ATTGATATTT AGTAAAATTA GTTGCACTTT GTGCATTTTT 60 

TCATAAGATG AGTCATATGT TTTAAATTGT AGTAATGAAA AACAGTATTA TATCATAATG 120 

AATTGGTATC TTAATAAAAG AGATGGAGGT AACTTATGGA TAACAATCCG AACATCAATG 180 

AATGCATTCC TTATAATTGT TTAAGTAACC CTGAAGTAGA AGTATTAGGT GGAGAAAGAA 240 

TAGAAACTGG TTACACCCCA ATCGATATTT CCTTGTCGCT AACGCAATTT CTTTTGAGTG 300 

AATTTGTTCC CGGTGCTGGA TTTGTGTTAG GACTAGT7GA TATAATATGG GGAATTTTTG 360 

GTCCCTCTCA ATGGGACGCA TTTCTTGTAC AAATTGAACA GTTAATTAAC CAAAGAATAG 420 

AAGAATTCGC TAGGAACCAA GCCATTTCTA GATTAGAAGG ACTAAGCAAT CTTTATCAAA 480 

TTTACGCAGA ATCTTTTAGA GAGTGGGAAG CAGATCCTAC TAATCCAGCA TTAAGAGAAG 540 

AGATGCGTAT TCAATTCAAT GACATGAACA GTGCCCTTAC AACCGCTATT CCTCTTTTTG 600 

CAGTTCAAAA TTATCAAGTT CCTCTTTTAT CAGTATATGT TCAAGCTGCA AATTTACATT 660 

TATCAGTTTT GAGAGATGTT TCAGTGTTTG GACAAAGGTG GGGATTTGAT GCCGCGACTA 720 

TCAATAGTCGTTATAATGAT TTAACTAGGC TTATTGGCAA CTATACAGAT CATGCTGTAC 780 

GCTGGTACAA TACGGGATTA GAGCGTGTAT GGGGACC3GA TTCTAGAGAT TGGATAAGAT 840 

ATAATCAATT TAGAAGAGAA TTAACACTAA CTGTATTAGA TATCGTTTCT CTATTTCCGA 900 

ACTATGATAG TAGAACGTAT CCAATTCGAA CAGTTTCCCA ATTAACAAGA GAAATTTATA 960* 

CAAACCCAGT ATTAGAAAAT TTTGATGGTA GTTTTCGAGG CTC5GCTCAG GGCATAGAAG 1020 

GAAGTATTAG GAGTCCACAT TTGATGGATA TACTTAACAG TATAACCATC TATACGGATG 1080" 

- CTCATAGAGG AGAATATTAT TGGTCAGGGC ATCAAATAAT GGCTTCTCCT GTAGGGTTTT 1140 

1S2 
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CGGGGCCAGA 
G7A77G77GC 
GACC7777AA 
C77A7GGAAC 
: CGC7GGA7GA 
■ GA77AAGCCA 
GAGC7CC7A7 
CACAAATTAC 
7TAAAGGACC 
CAACC77AAG 
ACGC77C7AC 
GGAA77777C 
TAGGTTTTAC 
A7G7C7TCAA 
TAACCTTTGA 
C77C77CCAA 
CCAA777AG7 
AGAAAG7CAA 
T7AGAGGGA7 
AAGGAGGCGA 
GC7A7CCAAC 
ACCAA77AAG 
A7GCCAAACA 
CAAG7CCAA7 
GA7GTACAGA 
ATGGCCA7GC 
CAC7AGC7C3 



A77CAC7TT7 CCGCTATATG 
7CAAC7AGG7 CAGGGC3TG7 
7ATAGGGATA AA7AA7CAAC 
C7CC7CAAA7 7TGCCATCC3 
AATACCGCCA CAGAA7AACA 
TGTTTCAATG TT7CGTTCAG 
G77C7C77GG A7ACA7CG7A 
ACAAATACC7 77AACAAAA7 
AGGA777ACA GGAGGAGATA 
AG7AAA7A77 AC7GCACCA7 
CACAAA7TTA CAATTCCATA 
AGCAAC7A7G AG7AG7GGGA 
7AC7CCGTTT AAC77T7CAA 
77CAGGCAAT GAAG777A7A 
GGCAGAATAT GA777AGAAA 
7CAAA7CGGG 77AAAAACAG 
7GAG7G7T7A 7CTGATGAA7 
ACA7GCGAAG CGACTTAG7G 
CAA7AGACAA C7AGACCG7G 
7GACG7AT7C AAAGAGAATT 
G7A77TA7A7 CAAAAAA7AG 
AGG57ATATC GAAGATAGTC 
CGAAACAG7A AA7G7GCCAG 
CGGAAAATG7 GCCCATCATT 
C7TAAATGAG GAC7TAGGIG 
AAGAC7AGGA AA7C7AGAA7 
7G7GAAAAGA GCGGAGAAAA 



GAAC7A7GGG AAA7GCAGC7 
A7AGAACA77 A7CG7CCAC7 
AAC7A7C7G7 7C77GACGGG 
C7G7A7ACAG AAAAAGCGGA 
ACG7GCCACC 7AGGCAAGGA 
GC7TTAG7AA 7AG7AG7G7A 
G7GC7GAA77 7AA7AATATA 
C7AC7AA7C7 7GGCTCTGGA 
77C77CGAAG AAC7TCACC7 
7A7CACAAAG A7A7CGGG7A 
CA7CAAT7GA CGGAAGACC7 
G7AA77TACA G7CCGGAAGC 
A7GGATCAAG 7G7ATTTACG 
7AGA7CGAAT 7GAATT7GTT 
GAGCACAAAA GGCGG7GAAT 
A7G7GACGGA 7TA7CA7A77 
T77G7C7GGA 7GAAAAAAAA 
A7GAGC3GAA 777AC77CAA 
GC7GGAGAGG AAG7ACGGA7 
ACG77ACGC7 A77GGG7ACC 
A7GAG7CGAA A77AAAAGCC 
AAGAC77AGA AA7CTA7TTA 
G7ACGGG77C C7TA7GGCCG 
CCCA7CA777 C7CC77GGAC 
7A7GGG7GA7 A7TCAAGATT 
77C7CGAAGA GAAACCAT7A 
AA7GGAGAGA CAAACG7GAA 
1S3 
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2880 



3120 
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3300 
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3420 



GGGAAACAAA TATTGTTTAT AAAGAGGCAA AAGAATC7GT AGATGCTTTA 7TTGTAAACT 
CTCAATATGA TAGATTACAA GCSGATACCA ACATCGCGAT GATTCATGCG GCAGATAAAC 
GCGTTCATAG CATTCGAGAA GCTTATCTGC C7GAGC7GTC TGTGATTCCG GG7GTCAATG 2940 
CGGCTATTTT TGAAGAATTA GAAGGGCGTA TTTTCACTGC ATTCTCCCTA TATGATGCSA 3000* 
GA&ATGTCAT TAAAAATGGT GAT7TTAATA ATGGCTTATC CTGCTGGAAC G7GAAAGGGC 30 60 
ATGTAGATGT AGAAGAACAA AACAACCACC GTTCGGTCCT TGTTGTTCCG GAATGGGAAG 
CAGAAGTGTC ACAAGAAGTT CGTGTCTGTC CGGGTCGTGG CTATATCCTT CG7GTCACAG 
CGTACAAGGA GGGATATGGA GAAGGTTGCG TAACCATTCA TGAGATCGAG AACAATACAG 3240 
ACGAACTGAA GTTTAGCAAC TGTGTAGAAG AGGAAGTATA TCCAAACAAC ACGGTAACGT 
GTAATGATTA TACTGCGACT CAAGAAGAAT ATGAGGGTAC GTACACTTCT CGTAATCGAG 
GATATGACGG AGCCTATGAA AGCAATTCTT CTGTACCAGC TGATTATGCA TCAGCCTA7G 
AAGAAAAAGC ATATACAGAT GGACGAAGAG ACAATCCTTG TGAATCTAAC AGAGGATATG 3480 
GGGATTACAC ACCACTACCA GCTGGCTATG TGACAAAAGA ATTAGAGTAC TTCCCAGAAA 3540 
CCGATAAGGT ATGGATTGAG ATCGGAGAAA CGGAAGGAAC ATTCATCGTG GACAGCGTGG 
AATTACTTCT TATGGAGGAA TAATATATGC TTTATAATGT AAGGTGTGCA AATAAAGAAT 
GATTACTGAC TTGTATTGAC AGATAAA7AA GGAAATTTTT ATATGAATAA AAAACGGGCA 
TCACTCTTAA AAGAATGATG TCCGTTTTTT GTATGATTTA ACGAGTGATA TTTAAATGTT 
TTTTTTGCGA AGGCTTTACT TAACGGGGTA CCGCCACATG CCCATCAACT TAAGAATTTG 
CACTACCCCC AAGTGTCAAA AAACGTTATT CTTTCTAAAA AGCTAGCTAG AAAGGATGAC 
ATTTTTTATG AATCTTTCAA TTCAAGATGA ATTACAACTA TTTTCTGAAG AGCTGTATCG 
TCATTTAACC CCTTCTCTTT TGGAAGAACT CGCTAAAGAA TTAGGTTTTG TAAAAAGAAA 
ACGAAAGTTT TCAGGAAATG AATTAGCTAC CATATGTATC TGGGGCAGTC AACGTACAGC 4080 
GAGTGATTCT CTCGTTCGAC TATGCAGTCA ATTACACGCC GCCACAGCAC TCTTATGAGT 4140 
CCAGAAGGAC TCAATAAACG CTTTGATAAA AAAGCGGTTG AAT7TTTGAA ATATATTTTT 
TCTGCATTAT GGAAAAGTAA ACTTTGTAAA ACATCAGCCA TTTCAAGTGC AGCACTCACG 
TATTTTCAAC GAATCCGTAT TTTAGATGCG ACGATTTTCC AAGTACCGAA ACATTTAGCA 
CATGTATATC CTGGGTCAGG TGGTTGTGCA CAAACTGCAG 
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(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pure maize optimized synthetic 3T Cry LA (b) 
gene 



(xi ) SEQUENCE DESCRIPTION : SEQ ID NO : 2 : 








ATGGACAACA 


ACCCCAACAT 


CAACGAGTGC 


ATCCCCTACA 








GTGGAGGTGC 




GCGCATCGAG 


ACCGGCTACA 




CATCAGCCTG 


120 


AGCCTGACCC 


AGTTCCTGCT 


GAGCGAGTTC 


GTGCCCGGCG 


CCGGCTTCGT 


GCTGGGCCTG 


180 


GTGGACATCA 


TCTGGGGCAT 


CTTCGGCCCC 


AGCCAGTGGG 


ACGCCTTCCT 


GGTGCAGATC 


240 


GAGCAGCTGA 


TCAACCAGCG 


CATCGAGGAG 


TTCGCCCGCA 


ACCAGGCCAT 


CAGCCGCCTG 


300 




GCAACCTGTA 


CCAGATCTAC 


GCCGAGAGCT 


TCCGCGAGTG 


GGAGGCCGAC 


360 






CGAGGAGATG 


CGCATCCAGT 


TCAACGACAT 


GAACAGCGCC 


420 


CTGACCACCG 


CCATCCCCCT 


GTTCGCCGTG 


CAGAACTACC 


AGGTGCCCCT 


GCTGAGCGTG 


480 


TACGTGCAGG 


CCGCCAACCT 


GCACCTGAGC 


GTGCTGCGCG 


ACGTGAGCGT 


GTTCGGCCAG 


540 


CGCTGGGGCT 


TCGACGCCSC 


CACCATCAAC 


AGCCGCTACA 


ACGACCTGAC 


CCGCCTGATC 


600 


GGCAACTACA 


CCGACCACGC 


CGTGCGCTGG 


TACAACACCG 


GCCTGGAGCG 


CSTGTGGGGC 


660 


CCCSACAGCC 


GCGACTGGAT 


CCGCTACAAC 


CAGTTCCGCC 


GCGAGCTGAC 


CCTGACCGTG 


720 


CTGGACATCG 


TGAGCCTGTT 


CCCCAACTAC 


GACAGCCGCA 


CCTACCCCAT 




780 


AGCCAGCTGA 


CCCSCGAGAT 


CTACACCAAC 


CCCGTGCTGG 


AGAACTTCGA 


CGGCAGCTTC 


840 


CGCGGCAGCG 


CCCAGGGCAT 


CGAGGGCAGC 


ATCCGCAGCC 


CCCACCTGAT 


GGACATCCTG 


900 


AACAGCATCA 


CCATCTACAC 


CGACGCCCAC 


CGCGGCGAGT 


ACTACTGGAG 


u uuL wAL 


960 
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ATCATGGCCA 
ATGGGCAACG 
ACCC7GAGCA 
AGCG7GC7GG 
TACCGCAAGA 
CCCCCCCGCC 
AGCAACAGCA 
GAG77CAACA 
AACC7GGGCA 
CGCCGCACCA 
CAGCGCTACC 
ATCGACGGCC 
C7GCAGAGCG 
AGCAGCGTGT 
CGCATCSAG7 
CAGAAGGCCG 
ACCGAC7ACC 
CTGGACGAGA 
CGCAACC7GC 
CGCGGCAGCA 
ACCCTGC7GG 
AGCAAGC7GA 
C7GGAGA7C7 
GGCAGCC7G7 
CAC77CAGCC 
G7GAICT7CA 
GAGGAGAAGC 



GCCCCG7GGG 
CCGCCCCCCA 
GCACCCTG7A 
ACGGCACGGA 
GCSGCACCG? 
AGGGC7TCAG 
GCG7GAGCA7 
ACATCATCCC 
GCGGCACC&G 
GCCCCGGCCA 
GCG7GCGCAT 
GCCCCA7CAA 
GCAGC7TCCG 
7CACCCTGAG 
TCGTGCCCGC 
7GAACGAGC7 
ACATCGACCA 
AGAAGGAGCT 
TGCAGGACCC 
CCGACATCAC 
GCACC77CGA 
AGGCCTACAC 
ACCTGATCCG 
GGCCCC7GAG 
TGGACATCGA 
AGATCAAGAC 
CCC7GG7GGG 



C77CAGCGGC 
GCAGCGCATC 
CCGCC3CCCC 
GTTCGCCTAC 
GGACAGCC7G 
CCACCGCCTG 
CA7CCGCGCC 
CAGCAGCCAG 
CGTGGTGAAG 
GATCAGCACC 
CCGC7ACGCC 
CCAGGGCAAC 
CACCGTGGGC 
CGCCCACGTG 
CGAGGTGACC 
GTTCACCAGC 
GGTGAGCAAC 
GAGCGAGAAG 
CAACTTCCGC 
CATCCAGGGC 
CGAG7GC7AC 
CCGCTACCAG 
CTACAACGCC 
CGCCCCCAGC 
CGTGGGCTGC 
CCAGGACGGC 
CSAGGCCC7G 



CCCGAG77CA 
GTGGCGCAGC 
TTCAACATCG 
GGCACCAGCA 
GACGAGA7CC 
AGCCACG7GA 
CCCATGTTCA 
A7CACCCAGA 

CTGCGCG7GA 
AGCACCACCA 
77CAGCGCCA 
T7CACCACCC 
TTCAACAGCG 
7TCGAGGCCG 
AGCAACCAGA 
C7GGTGGAG7 
G7GAAGCACG 
GGCA7CAACC 
GGCGACGACG 
CCCACC7ACC 
C7GCGCGGC7 
AAGCACSAGA 
CCCA7CGGCA 
ACCGACC7GA 
CACGCCCGCC 
GCCCGCG7GA 
166 



CC77CCCCC7 
TGGGCCAGGG 
GCA7CAACAA 
GCAACG7GCC 
CCCCCCAGAA 
GCA7G77CCG 
GC7GGATCCA 
TCCCCC7GAC 
TCACCGGCGG 
ACA7CACCGC 
ACC7GCAG7T 
CCA7GAGCAG 
CC7TCAAC77 
GCAACGAGGT 
AG7ACGACC7 
TCGGCC7GAA 
GCC7GAGCGA 
CCAAGCGCC7 
GCGAGC7GGA 
TG77CAAGGA 
7G7ACCAGAA 
ACA7CGAGGA 
CCG7GAACG7. 
AG7GCGCCCA 
ACGAGGACC7 
TGGGCAACC7 
AGCGCGCCGA 



G7ACGGCACC 1020 

CG7G7ACCGC 1080 

CCAGCAGCTG 1140 

CAGCGCCG7G 1200 

CAACAACGTG 1260^ 

CAGCGGC7TC 1320 

CCGCAGCGCC 1380 

CAAGAGCACC 1440 

CGACA7CC7G 1500 

CCCCC7GAGC 1560 

CCACACCAGC 1620 

CGGCAGCAAC 1680 

CAGCAACGGC 1740 

GTACATCGAC 1800 

GGAGCGCGCC 1860 

GACCGACGTG 1920 

CGAG77C7GC 1980 

GAGCGACGAG 2040 

CCGCGGCTGG 2100 

GAAC7ACG7G 2160 

GA7CGACGAG 2220 

CAGCCAGGAC 2280 

GCCCGGCACC 2340 
CCACAGCCAC 240a 
GGGCG7G7GG 2460 
GGAG77CC7G 2520 
GAAGAAGTGG 2580 
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CGCGACAAGC 


GCGAGAAGCT 


GGAGTGGGAG 


ACCAACATCG 


TGTACAAGGA 


GGCCAAGGAG 


2640 


AGCGTGGACG 


CCCTG7TCGT 


GAACAGCGAG 


?ACGACCSCC 


7GCAGGCCGA 


CACCAACATC 


2700 


GCCATGATCC 


*m &m *m *m *m +m m 


CAAGCGCGTG 


CACAGCATCC 


GCGAGGCCTA 


CC7GCCCGAG 


2760 


CTGAGv-GTGA 




GAACGCCGCC 


ATCTTCGAGG 


AGC7GGAGGG 


CCGCATC7TC 


2820 




GwCToTACGA 


CGCCCGCAAC 


GTGATCAAGA 


ACGGCGAC77 


CAACAACGGC 


2880 


CTGAGCTGC? 


GGAACGTGAA 


GGGCCACGTG 


GACGTGGAGG 


AGCAGAACAA 


CCACCGCAGC 


2940 


GTGCTGvaTGG 


7GCCCGAGTG 


GGAGGCCGAG 


GTGAGCCAGG 


AGGTGCGCGT 


GTGCCCCGGC 


3000 


CGCGGCTACA 


TCCTGCGCGT 


GACCGCCTAC 


AAGGAGGGCT 


ACGGCGAGGG 


CTGCGTGACC 


3060 


ATCCACGAGA 


TCGAGAACAA 


CACCGACGAG 


CTGAAGTTCA 


GCAAC76CGT 


GGAGGAGGAG 


3120 


GTGTACCi-wA 


ACAACACCG7 


GACCTGCAAC 


GACTACACCG 


CCACCCAGGA 


GGAG7ACGAG 


3180 


GuCAC wTACA 


CCAGuCGCAA 


CCGCGGC7AC 


GACGGCGCCT 


ACGAGAGCAA 


CAGCAGCG7G 


3240 


CCCGCC3AC7 


ACGCCAGCGC 


C7ACGAGGAG 


AAGGCCTACA 


CCGACGGCCG 


CCGCSACAAC 


2300 


CCCTGCGAGA 


GCAACCGCGG 


C7ACGGCGAC 


TACACCCCCC 


7GCCCGCCGG 


C7ACG7GACC 


3360 


AAGGAGCTGG 


AGTACTTCCC 


CGAGACCGAC 


AAGG7GTGGA 


TCGAGATCGG 


CGAGACCGAG 


3420 


GGCACGT7CA 


TCGTGGACAG 


CGTGGAGC7G 


C7GC7GA7GG 


AGGAGTAGTA 


CA7G 


3474 



(2) INF0RMA7I0N FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Truncated synthetic maize optimized 37 
Cry I A (b) gene 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GA7CCAACAA 7GGACAACAA CCCCAACATC AACGAG7GCA TCCCCTACAA C7GCC7GAGC 60 
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AACCCCGAGG TGGAGGTGC7 GGGCGGCGAG CGCATCGAGA CCGGC7ACAC CCGCATCGAC 
ATCAGCCTGA GCCTGACCCA GTTCCTGCTG AGCGAGTTCG TGCCCGGCGC CGGCT7CGTG 
CTGGGCCTGG TGGACATCAT CTGGGGCATC TTCGGCGCCA GCCAGTGGGA CGCCTTCCTG 
GTGCAGATCG AGCAGCTGAX CAACCAGCGC ATCGAGGAGT TCGCCCGCAA CCAGGCCATC 
AGCCGCCTGG AGGGCCTGAG CAACCTGTAC CAAATC7ACG CCGAGAGC7T CCGCGAGTGG 360. 
GAGGCCGACC CCACCAACCC CGCCCTGCGC GAGGAGATGC GCATCCAG7T CAACGACATG 420 
AACAGCGCCC TGACCACCGC CATCCCCCTG TTCGCCGTGC AGAACTACCA GGTGCGCCTG 480 
CTGAGCGTGT ACGTGCAGGC CGCCAACCTG CACCTGAGCG TGCTGCGCGA CGTCAGCGTG 540 
TTCGGCCAGC GCTGGGGCTT CGACGCCGCC ACCATCAACA GCCGC7ACAA CGACCTGACC 
CGCCTGATCG GCAACTACAC CGACCACGCC GTGCGCTGGT ACAACACCGG CCTGGAGCGC 
GTGTGGGGTC CCGACAGCCG CGACTGGATC AGGTACAACC AGTTCCSCCG CGAGCTGACC 
CTGACCGTGC TGGACATCG7 GAGCCTG7TC CCCAACTACG ACAGCCGCAC C7ACCCCATC 780 
CGCACCGTGA GCCAGCTGAC CCGCGAGATT TACACGAACC CCGTGCTGGA GAACTTCGAC 
GGCAGCTTCC GCGGCAGCGC CCAGGGCATC GAGGGCAGCA TCCGCAGCCC CCACCTGATG 
GACATCCTGA ACAGCATCAC CATCTACACC GACGCCCACC GCGGCGAGTA CTACTGGAGC 960 
GGCCACCAGA TCATGGCCAG CCCCGTCGGC TTCAGCGGCC CCGAG7TCAC CTTCCCCCTG 1020 
TACGGCACCA TGGGCAACGC TGCACCTCAG CAGCGCATCG TGGCACAGCT GGGCCAGGGA 1080 
G7GTACCGCA CCC7GAGCAG CACCCTGTAC CG7CGACC7T TCAACATCGG CATCAACAAC 1140 
CAGCAGCTGA GCGTGCTGGA CGGCACCGAG TTCGCCTACG GCACCAGCAG CAACCTGCCC 1200 
AGCGCCGTGT ACCGCAAGAG CGGCACCGTG GACAGCC7GG ACGAGATCCC CCCTCAGAAC 1260 
AACAACGTGC CACCTCGACA GGGCTTCAGC CACCGTCTGA GCCACGTGAG CATGTTCCGC 1320 
AGTGGCT5CA GCAACAGCAG CGTGAGCATC ATCCGTGCAC CTATGTTCAG CTGGATTCAC 1380 
CGCAGTGCCG AGTTCAACAA CATCATCCCC AGCAGCCAGA TCACCCAGAT CCCCCTGACC 1440 
AAGAGCACCA ACCTGGGCAG CGGCACCAGC GTGGTGAAGG GCCCGGGCTT CACGGGCGGC 1500 
GACATCCTGC GCCGCACCAG CCCCGGCCAG ATCAGCACCC TGCGCGTGAA CATCACCGCC 1560 
CCCCTGAGCC AGCGCTACCG CGTCCGCATC CGCTACGCCA GCACCACCAA CCTGCAGTTC 1620 
CACACCAGCA TCGACGGCCG CCCCATCAAC CAGGGCAACT TCAGCGCCAC CATGAGCAGC 1680 

153 



600 
660 
720 



340 
900 



WO 93/07278 PCT/US92/08476 

GGCAGCAACC TGCAGAGCGG CAGCTTCCGC ACCGTGGGCT TCACCACCCC C7TCAAC7TC 1740 

AGCAACGGCA GCAGCGTG7T CACCC7GAGC GCCCACGTGT TCAACAGCGG CAACGAGGTG 1800 

TACATCGACC GCATCGAGTT CGTGCCCGCC GAGGTGACCT TCGAGGCCGA GTACGACCTG 1860 

GAGAGGGCTC AGAAGGCCGT GAACGAGCTG TTCACCAGCA GCAACCAGAT CGGCC7GAAG 1920 

ACCGACG7GA CCGACTACCA CATCGATCAG GTGTAGGAGC 7 1961 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3508 base pairs 

(B) TYPE: nucleic acid 
(C5 STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Full length synthetic maize optimized 3T 
CrylA(b) gene 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GATCCAACAA TGGACAACAA CCCCAACATC AACGAGTGCA TCCCCTACAA CTGCCTGAGC 60 

AACCCCGAGG TGGAGGTGCT GGGCGGCGAG CGCATC3AGA CCGGCTACAC CCCCATCGAC 120 

ATCAGCCTGA GCC7GACCCA GTTCCTGCTG AGCGAGTTCG TGCCCGGCGC CGGCTTCGTG 180 

CTGGGCCTGG TGGACATCAT CTGGGGCATC TTCGGCCCCA GCCAGTGGGA CGCCTTCCTG 240 

GTGCAGATCG AGCAGCTGAT CAACCAGCGC ATCGAGGAGT TCGCCCGCAA CCAGGCCATC 300 

AGCCGCCTGG AGGGCCTGAG CAACCTGTAC CAAATCTACG CCGAGAGCTT CCGCGAGTGG 360 

GAGGCCGACC CCACCAACCC CGCCCTGCGC GAGGAGATGC GCATCCAGTT CAACGACATG 420 

AACAGCGCCC TGACCACCGC CATCCCCCTG TTCGCCGTGC AGAACTACCA GGTGCCCCTG 480 

CTGAGCGTGT ACGTGCAGGC CGCCAACCTG CACCTGAGCG TGCTGCGCGA CGTCAGCGTG 540 

TTCGGCCAGC GCTGGGGCTT CGACGCCGCC ACCATCAACA GCCGCTACAA CGACCTGACC 600 

CGCCTGATCG GCAACTACAC CGACCACGCC GTGCGCTGGT ACAACACCGG CCTGGAGCGC 660 

GTGTGGGGTC CCGACAGCCG CGACTGGATC AGGTACAACC AGTTCCGCCG CGAGCTGACC 720 
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C7GACCG7GC TGGACATCGT GAGCC7G77C CCCAAC7AC3 ACAGCCGCAC C7ACCCCA7C 
CSOCCS»» GCCAGC7GAC CCGCGAGATT TACACCAACC CCG7GC7GGA GAACTTCGAC 
GGCAGC77CC GCGGCAGCGC CCAGGGCA7C GAGGGCAGCA TCCGCAGCCC CCACCTGATG 
GACA7CC7GA ACAGCA7CAC CA7C7ACACC GACGCCCACC GCGGCGAG7A C7AC7GGAGC 
GGCCACCAGA TCATGGCCAG CCCCGTCGGC TTCAGCGGCC CCGAGTTCAC C77CCCCC7G 
TACGGCACCA 7GGGCAACGC TGCACGTCAG CAGCGCATCG 7GGCACAGC7 GGGCCAGGGA 
GTGTACCGCA CCCTGAGCAG CACCCTGTAC CGTCGACCTT 7CAACA7CGG CATCAACAAC 
CAGCAGCTGA GCG7GCTGGA CGGCACCGAG T7CGCCTACG GCACCAGCAG CAACC7GCCC 
AGCGCCG7G7 ACCGCAAGAG CGGCACCG7G GACAGCC7GG ACGAGA7CCC CCC7CAGAAC 
AACAACG7GC CACC7CGACA GGGC77CAGC CACCG7C7GA GCCACG7GAG CA7G77CCGC 
AG7GGC77CA GCAACAGCAG CG7GAGCA7C A7CCG7GCAC C7A7G77CAG C7GGA77CAC 
CGCAG7GCCG AG77CAACAA CA7CA7CCCC AGCAGCCAGA 7CACCCAGA7 CCCCC7GACC 
AAGAGCACCA ACC7GGGCAG CGGCACCAGC G7GG7GAAGG GCCCCGGC7T CACCGGCGGC 
GACATCC7GC GCCGCACCAG CCCCGGCCAG A7CAGCACCC 7GCGCG7GAA CA7CACCGCC 
CCCC7GAGCC AGCGC7ACCG CG7CCGCA7C CGC7ACGCCA GCACCACCAA CC7GCAG7TC 
CACACCAGCA 7CGACGGCCG CCCCA7CAAC CAGGGCAAC7 7CAGCGCCAC CA7GAGCAGC 
GGCAGCAACC 7GCAGAGCGG CAGC77CCGC ACCG7GGGC7 7CACCACCCC C77CAAC77C 
AGCAACGGCA GCAGCG7G77 CACCC7GAGC GCCCACG7G7 7CAACAGCGG CAACGAGG7G 
TACATCGACC GCATCGAG77 CS7GCCCGCC GAGG7GACC7 7CGAGGCCGA G7ACGACC7G 
GAGAGGGC7C AGAAGGCCG7 GAACGAGC7G 77CACCAGCA GCAACCAGA7 CGGCCTGAAG 
ACCGACG7GA CCGAC7ACCA CA7CGA7CAG G7GAGCAACC 7GGTGGAG7G CC7GAGCGAC 
GAG77C7GCC 7GGACGAGAA GAAGGAGC7G AGCGAGAAGG 7GAAGCACGC CAAGCGCC7G 
AGCGACGAGC GCAACC7GC7 GCAGGACCCC AAC77CCGCG GCA7CAACCG CCAGC7GGAC 
CGCGGC7GGC GCGGCAGCAC CGACA7CACC A7CCAGGGCG GCGACGACG7 G77CAAGGAG 
AAC7ACG7GA CCC7GC7GGG CACC77CGAC GAG7GC7ACC CCACC7ACC7 G7ACCAGAAG 
A7CGACGAGA GCAAGC7GAA GGCC7ACACC CGC7ACCAGC TGCGCGGC7A CA7CGAGGAC 
AGCCAGGACC 7GGAGA7C7A CC7GA7CCGC 7ACAACGCCA AGCACGAGAC CG7GAACG7G 
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ACCAACATCG 


CCATGATCCA 
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AAGCGGGTGP 
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nwluwil • wo 


f ^ A (ZfT m ^ r* 
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w X wwww wAww 


TGAGCGTGAT 


CCCC5GCSTG 

WWWWW^JW^W X w 
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AAwwwWsJwwA 


i wX XwOAUwA 


ww x ww^iwwOw 
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CGCATCTTCA 


CCGCCTTCAG 


ww * W XAWdA^ 
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wkJwwwXlw ^ TC 
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AACAACGGCC 


TGAGCTGCTG 


GAACGTGAAG 


ggp^"* a {■""■"!•!* ft.* - : 

UUUwACvJ X UV3 


Aww x bbhbuh 


VjwaV3aAUaaC 
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GAGGf~r*tAf":f"* 

UAUOL w OliUU 




CO 1 UwwwO X \J 


^ rt ft rt 
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mf+ ^++0+0*0*0+0*0* 


GCGGC7ACA7 


CC7GCSCSTG 

WW — i ww ww w X w 


AwwtJww ^itwi 


Aw oAtaiio w x A 


ww-wwoAUOww 


^ ft ^ n 

3060 


TGC3TGACCA 


7CCACGAGAT 


CGAGAACAAC 


ACCGACGAGC 


TCAAGTTCAG 


CaAC x GwG T G 
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GAGGAGGAGG 


TG7ACCCCAA 


CAACAP£*GTG 

WAAwAW WW X O 


ACCTGCAACG 


AC7ACACCGC 


CAC C wAG oAG 


^ ^ a ft 

3180 


GAGTACGAGG 


GCACCTACAC 


CAGCCGCAAC 


CGCSGC7ACG 


ACGGCGCCTA 


u wA wACs CAAC 


^ft A ft 
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AGCAGCG7GC 


CCGCCGACTA 


WwwwAwwwww 


TACGAGGAGA AGGCCTACAC 


f+0*^, f 0* r+ *+ *m 
'^.wXlwOOwwOw 


^ ft ft 
32C0 


CGCGACAACC 


^ w ^ WwyflyAw 


V-O^XTlWW WW 


TACGGCGACT 


AwAww>Wwww * 


f»0+ 0m 0+0+ 0m 0m 0*++ 0m 
V^WWWOwwOAJIw 


3360 


TACGTGACCA 


AGGAGCTGGA 


G7AC7TCCCC 


GAGACCGACA 


AGGTGTGGAT 


CGAGATCGGC 


3420 


GAGACCGAGG 


GCACCTTCAT 


CG7G6ACAGC 


GTGGAGCTGC 


TGCTGATGGA 


GGAGTAGTAC 


3480 


ATGTGATAGT 


ACGTAAGCTC 


GAGGATC? 
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(2) INFORMATION FOR SEQ ID NO:5: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 184S base pairs 

(B) TYPE: nucleic acid* 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Truncated syntiienic 3T gene (Periak ec ai.) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGGACAACA ACCCAAACAT CAACGAATGC ATTCCATACA ACTGCTTGAG 7AACCCAGAA 
GTTGAAGTAC TTGGTGGAGA ACGCATTGAA ACCGGTTACA CTCCCATCGA CATCTCCTTG 
TCCTTGACAC AGTTTCTGC7 CAGCGAGTTC G7GCCAGGTG CTGGGTTCGT TCTCGGACTA 
GTTGACATCA TCTGGGGTAT C7TTGGTCCA TCTCAATGGG ATGCATTCCT GGTGCAAATT 
GAGCAGTTGA TCAACCAGAG GATCGAAGAG TTCGCCAGGA ACCAGGCCAT CTCTAGGTTG 
GAAGGATTGA GCAATCTCTA CCAAATCTAT GCAGAGAGCT TCAGAGAGTG GGAAGCCGAT 
CCTACTAACC CAGCTCTCCG CGAGGAAATG CGTATTCAAT TCAACGACAT GAACAGCGCC 
TTGACCACAG CTATCCCATT GTTCGCAGTC CAGAACTACC AAGTTCCTCT CTTGTCCGTG 
TACGTTCAAG CAGCTAAICT TCACCTCAGC GTGCTTCGAG ACGTTAGCGT GTTTGGGCAA 
AGGTGGGGAT TCGATGCTGC AACCATCAAT AGCCGTTACA ACGACCTTAC TAGGCTGATT 
GGAAACTACA CCGACCACGC 7G7TCGTTGG TACAACACTG GCTTGGAGCG TGTCTGGGGT 
CCTGATTCTA GAGATTGGAT TAGATACAAC CAGTTCAGGA GAGAATTGAC CCTCACAGTT 
TTGGACATTG TG7CTCTCTT CCCGAACTAT GAC7CCAGAA CCTACCC7AT CCGTACAGTG 
TCCCAACTTA CCAGAGAAAT C7ATACTAAC CCAGTTCTTG AGAACTTCGA CGGTAGC77C 
CGTGGTTCTG CCCAAGGTAT CGAAGGC7CC A7CAGGAGCC CACACTTGAT GGACATCT7G 
AACAGCATAA C7ATCTACAG CGATGC7CAC AGAGGAGAGT ATTACTGGTC TGGACACCAG 
AXCATGGCCT CTCCAGTTGG ATTCAGCGGG CCCGAG7TTA CCTT7CCTC7 CTATGGAACT 
ATGGGAAACG C-GC7CCACA ACAACGTATC GTTGCTCAAC TAGG7CAGGG TGTCTACAGA 
ACCTTGTCTT CCACC7TGTA CAGAAGACCZ TTCAATATCG GTATCAACAA CCAGCAAC77 
TCCGTTCTTG ACGGAACAGA GTTCGCCTAT GGAACCTCT? CTAACTTGCC ATCCGCTGT? 
TACAGAAAGA GCGGAACCGT TGATTCCTTG GACGAAATCC CACCACAGAA CAACAATGTG 
CCACCCAGGC AAGGATTCTC CCACAGGTTG AGCCAC3TGT CCATGTTCCG TTCCGGATTC 
AGCAACAGTT CCGTGAGCAT CATCAGAGCT CCTATGTTCT CATGGATTCA TCGTAGTGCT 1380. 
GAGTTCAACA ATATCATTCC TTCCTCTCAA ATCACCCAAA TCCCATTGAC CAAGTCTACT 
AACCTTGGA? CTGGAACTTC TGTCGTGAAA GGACCAGGCT TCACAGGAGG TGATATTCTT 
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AGAAGAACTT CTCCTGGCCA GATTAGCACC 
CAAAGATATC GTGTCAGGAT TCG7TACGCA 
ATCGACGGAA GGCCTATCAA TCAGGGTAAC 
TTGCAATCCG GCAGC7TCAG AACCGTCGGT 
TCAAGCGTTT TCACCC7TAG CGCTCATGTG 
CGTATTGAGT TTGTGCCTGC CGAAGTTACC 



CTCAGAGTTA ACATCACTGC ACCACTTTCT 
TCTACCACTA ACTTGCAATT CCACACCTCC 
TTCTCCGCAA CCATGTCAAG CGGCAGCAAC 
TTCACTACTC CTTTCAACTT CTCTAACGGA 
TTCAATTCTG GCAATGAAGT GTACATTGAC 
TTCGAGGCTG AGTAC 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRnicI36£r) 



A. The indications made beiow relate to the nueroor^nism referred to in the descnpuon 
on page 159 , lines 21-25 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet [""") 



Name of depositary institution 

Agricultural Research Culture Collection (HHEL) 



Peoria r XL 61604 




U.S.A. 




Date of deposit 


Accession Number 


4 October 1991 (04.10.91) 


5RRL B 18891 



Address of depositary msritution (iackalngposiai code and country) 
1815 Horth University Street 



C ADDITIONAL INDICATIONS (leavebknkif not applicable) This information is continued on an additional sheet Q 
We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (sfihcbidko^mnu^aUiai^mtdSta^ 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if net applicable) 



Tbe indications listed beiow will besubmitxed to the International Bureau izter (specify the gatertUnaturcofthei 
Number of Deposit*) 



'inaicnti<vae.g^ 'Accession 



For receiving Office use only 



I 1 This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



| | This sheet was received by the Interna bona! Bureau on: 



Authorized offic 



Form PCI7RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule llbis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 159 .lines 21-25 


B* IDENTIFICATION OF DEPOSIT 




Further deposits are identified on an additional sheet 


□ 


Name of depositary institution 








Agricultural Research Culture Collection (HBRL) 




Address of depositary institution ftnelaalng postal code end country) 




1815 North University Street 

Peoria, XL 61604 

U.S.A. 








Date of deposit 

4 October 1991 (04.10.91) 


Accession Number 
BRRL B 18892 


C. ADDITIONAL INDICATIONS (leave blamk if m* applkaUe) This iofbnaatioa is continued on an additional sheet 


□ 


We request the Expert Solution where 


available 




D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the udbaion are net far ell Jmpuui State) ' 




E. SEPARATE FURNISHING OF INDICATIONS (In 


ve 


blankif not applicable) 




n£ ^JSSSrT* beJOWW1 " bcsabnune<1 10 *** »««na««oiMl Bvmau ^{spcdfrtega^naUMoflheindicauaae.^ 'Acetate* 


1 1 "Hiis sheet was received with the iniernational application 




|~| This sheet was received by the wternabonal Bureau on: 


Authorized officer 




Authorized officer 



Form PCI7RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc I3bis) 

A, The indications nude below relate to the microorganism referred to in the description 
on page , TSQ , lines 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet | | 
Name of depositary institution 

Agricultural Research Culture Collect-inn (HHHL) 

Address of depositary institution (iartaalnx postal coaeajtf country) 

1815 Hdrth University Street 

Peoria, H. 61604 

TJ.S.A* 



Date of deposit 


Accession Number 


3 September 1992 (03.09.92) 


HRRL B 18994 



C ADDITIONAL INDICATIONS (lemvblank if not applicable) Tnis information is contimied on an additional sheet Q 



We request the * Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fiftheuialcoaonsarenotforali aenptated Slates) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



Themdicatians listed beiowwill besubinittedto the international Bureau later (specify the^eneraittamreoftheiaaicauome.^ % 
Number of Deposit") 



For receiving Office use only 



1 1 Tnis sheet was received with the international ap pl ic ati on 



Authorized offic 



For International Bureau use only 



This sheet was received by the International Bureau c 



Authorized officer 



Foim PCI7RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuic 13bis) 



| A ' ^ indications nude beiow relate to the microorganism referred to in the description 
o«!»Sc \$9 m , lines 21-25 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet [~ 



Name of deposiury institution 

Agricultural Research Culture Collection (KRKL) 
Address of depositary institution Cu^iuding posui code *md country) 



1815 Hortfa University Street 

Peoria, XL 61604 

U.S.A. 







Accession Number 


3 September 1992 (03.09.92) 




HRRL B 18995 


j C ADDITIONAL INDICATIONS (leave blank if M 


xappfcable) This information is continued on an •A**«i^ rn \ *h~t [""] 



»b request the Expert Solution where available 



P. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the iadicatiaa an not for ait imputed States) 



E. SEPARATE FURNISHING OF INDICATIONS (W blank if net applicable) 



For receiving Office use only 



D w « received with the international application 



Authorized officer 



Form PCT/RO/134 (July 1992) 



For Interna tionai Bureau use only 



Q This sheet was received by the International Bureau 



on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulc I2bis) 



A. The indications made below relate to the microorganism referred to in the description 
onpage 159 .lines 21^25. 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet □ 



Name of depositary institution 

AgcLcultnral Research Culture Collection (HEEL) 



Address of depositary institution (including postal code and country) 

1815 north University Street 

Peoria, 3X 61604 

U.S.A- 



Date of deposit 

4 September 1992 (04.09.92) 


Accession Number 
SHKL B 18996 


C ADDITIONAL INDICATIONS (leave bloat if a* applicable) This information is continued on an additional sheet Q 



We request the ■ » Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE <?/ the indications arc act for all desirnated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blonkif net applicable) 



Thetndicau'onsiistedbeiow will be submitted to the international Bureau later (specify thegencrai nature of the imiirtttinns c*&, *; 
Number of Deposit') 



For receiving Office use only 



| | This sheet was received with the intemationai application 



Authorized officer 



Bar interna tianaJ Bureau use only 



Q Ihis sheet was received by the International Bureau am 



Authorized officer 



Form PCT7RO/I34 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuic 13bis) 



A. "The indications made bciow relate to ihc microorganism referred to in the description 
on page 159 jj neS 21-25 


B. IDENTIFICATION OF DEPOSIT 


Further deposits are identified on an additional sheet [ j 


Name oi deposiury institution 




Agricultural Research Culture Collection (HRHL) 




1815 Hortfa University Street 

Beoria r XL 61604 

U.S.A. 




Date of deposit 

21 September 1992 (21.09.92) 


Accession Number 
HRHL 3 18997 



C ADDITIONAL INDICATIONS (leave blank if ex* appacabk) Tais infonnation is continued on an additional sheet Q 



request the Expert Solution vhere available 



a DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fifth* 



ore net for aU deagmani St tries) 



B. SEPARATE FURNISHIN G OF INDICATIONS (leave bhmk if not appScabie) 
N^oTd^ 1 ^ WOW W,i,bcSUbmincd 10 ^«na«onai Bureau Uter*^^^,^^] 



'Ac 



Foriecerving Office use only 



C3 sheet was received with the international appii. 



application 



Authorized officer 



Form PCT/RO/134 (July 1992) 



For International Bureau use oniy 



IZ3 sheet was received by the Iittrarional Bureau 



on: 



Authorized officer 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRoic Ubis) 



A. The indications made below relate to the microorganism referred to in the description 
159 .lines 21-25 



on page 



B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet [""*] 



Name of depositary institution 

Agricultural Research Culture Collection (HRRL) 



Address of depositary institution (indu&tx postal code, aid cowry) 

1815 Hdrth University Street 

Peoria, XL 61604 

TJ-S-A- 



Date of deposit 

21 September 1992 (21.09.92) 



Accession Number 
HRRL 3 18998 



C ADDITIONAL INDICATIONS (leave bksskif mot appBcable) This information is continued on an additional sheet Q 



We request the 



Solution wnere available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fifth* Miea t rnitr arenetfarali dat purtrtl Stain} 



£. SEPARATE FURNISHING- OF INDICATIONS (leave blaakif not applicable) 



Tne indications listed beiow will be submitted to the internauonai Bureau i2ter(spedfythegauraiiuiureafthe^ 'Accession 

/timber of Deposit") 



For receiving Office use only 



1""] I^^^^^reoervedwh^ tbeinteinaticaia] appiication 



Authorized officer 



For InternationaJ Bureau use only 



| [ Tnts sheet was received by tne Internatianal Bureau on: 



Amhanzed oulcer 



Form PCI7RO/134 (July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuie I3bis) 

A The indications made beiow relate to the microorganism referred to in tbe description 
on page 159 jhies 21-25 ' 

B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 

Name of depositary institution 

Agricultural Research Culture mt^^rt lrm (HRKL) 

Address of depositary institution (tadadwg postal code and country) 
1815 Jtarth University Street 



Peoria, IL 61604 




U.S.A. 






Accession Number 


21 September 1992 (21.09.92) 


HRBZ* B 18999 



a ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additiooai sheet Q 



We request the Expert Solution where available 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (ifthe iruika^ 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



^^^^^ b ^ wiU ^^oouttcd to the international Bureau iztaispa^tke^ateraliunurecfihc 



uiauattom&g^ 'Accession 



For receiving Office use onJy 



D sac* was received with the international applicatioi 



Authorized officer 



Far interna tiouaJ Bureau use oniy 



I [ This sheet received by tbe International Bureau 



OK 



Authorized officer 



Fonn POVRO/134 (Juiy 1992) 
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What is claimed is: 

1. A nucleotide sequence comprising a maize optimized 
coding sequence which encodes a protein capable of killing an 
insect. 

2. A nucleotide sequence of claim 1, wherein said 
coding sequence encodes a 3 .t ■ protein. 

3. A nucleotide sequence of claim 2, wherein said 
coding sequence encodes a CrylA(b) protein. 

4. A nucleotide sequence of claim 3, wherein said 
coding sequence comprises Sequence 3, Sequence 4, or the 
sequence set forth in Fig. 7. 

5. A nucleotide sequence of claim 3, wherein coding 
sequence encodes a CrylA(b) protein which is heat stable 
compared to a native CrylA(b) protein. 

S. A nucleotide sequence of claim 5, wherein said 
coding sequence comprises a sequence selected from the group 
of sequences consisting of Fig. 9, Fig. 11, Fig. 13 and Fig. 15. 

7. A nucleotide sequence of claim 2, wherein said 
coding sequence encodes a Cry IB or a CrylA(c) protein. 

8* A nucleotide sequence of claim 7, wherein said 
coding sequence comprises the sequence encoding a CrylB protein 
set forth in Fig. 6. 

9. A nucleotide sequence of claim 1/ further comprising 
a first promoter capable of directing expression of a nucleotide 
sequence in a plant cell, operably linked to said coding 
sequence . 

10. A nucleotide sequence of claim 9, wherein said 
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promoter is capable of directing expression of the associated 
coding sequence in a maize ceil. 

11. A nucleotide sequence of claim 9, wherein said 
promoter is selected from the group consisting of inducible 
promoters f constitutive promoters, temporal or 
developmentally-regulated promoters, tissue-preferred, and 
tissue-specific promoters. 

12. A nucleotide sequence of claim 9, wherein said 
promoter is selected from the group consisting of a CaMV 3SS 
promoter, CaMV 19S promoter, a PEP carboxylase promoter, a 
pith-preferred promoter, and a pollen-specific promoter. 

13. A nucleotide sequence of claim 9, wherein said 
promoter is a pith-preferred promoter comprising the DNA 
sequence set forth in Fig. 24. 

14. A nucleotide sequence of claim 9, wherein said 
promoter is a pollen-specific promoter comprising the DNA 
sequence set forth in Fig. 35. 

15. A nucleotide sequence of claim 9, further 
comprising a second promoter capable of directing expression of 
an associated coding sequence in a plant cell, operatively 
linked to a second coding sequence. 

17. A nucleotide sequence of claim 15, wherein said 
second promoter is selected from the group consisting of 
inducible promoters, constitutive promoters, temporal or 
developmentally-regulated promoters, tissue-preferred, and 
tissue-specific promoters. 

18. A nucleotide sequence of claim 15, wherein said 
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second promoter is selected from the group consisting of a CaMV 
35S promoter, CaK* 19S promoter, a PEP carboxylase promoter, a 
pith-preferred promoter, and a pollen- specific promoter. 

21. A nucleotide sequence of claim 15, wherein said 
second coding sequence is a plant optimized coding sequence 
which encodes a protein capable of killing an insect. 

22. A nucleotide sequence of claim 21, wherein said 
second coding sequence encodes a 3.t. protein. 

23. A nucleotide sequence of claim 22, wherein said 
second coding sequence encodes a CryIA<b) protein. 

29. A nucleotide sequence of claim 15, wherein said 
second coding sequence is a marker gene. 

30. A recombinant vector, comprising a nucleotide 

sequence of claim 9. 

31. A recombinant vector, comprising a nucleotide 
sequence of claim 9, wherein said coding sequence encodes a 
protein. 

32. A recombinant vector of claim 31, wherein the 
protein is a CrylA(b) protein. 

33. A recombinant vector, comprising a nucleotide 

sequence of claim 15. 

34. A recombinant vector, comprising a nucleotide 

sequence of claim 18. 

35. A recombinant vector, comprising a nucleotide 

sequence of claim 22. 

36. A plant stably transformed with a nucleotide 

s quence of claim 9 . 
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37 . A plant stably transformed with a nucleotide 
sequence of claim 12. 

38. A plant of claim 36 f wherein said coding sequence 
encodes a B . t . protein. 

39. A plant of claim 36, wherein said protein is 
expressed in said plant in an amount sufficient to control 
Lepidopteran or Coleopteran insects. 

40. A plant of claim 38 , wherein the B.t. protein is a 
CrylA(b) protein. 

41. A plant of claim 38, which expresses the 3.t. 
insecticidal protein in an amount sufficient to control 
Lepidopteran or Coleopteran pests. 

42. A plant of claim 38, wherein the amount is 
sufficient to control insects selected from the group consisting 
of European com borer , Sugarcane borer , stalk borers , cutworms , 
army worms, rootworms, wireworms and aphids. 

43. A plant stably transformed with a nucleotide 
sequence of claim 15. 

44. A plant stably transformed with a nucleotide 
sequence of claim 18. 

45. A plant of claim 43 , wherein the first and second 
coding sequences each encode a B.t. protein. 

46. A plant of claim 38, which expresses the B.t , 
insecticidal proteins in an amount sufficient to control 
Lepidopteran or Coleopteran pests. 

47. A plant of claim 46, wherein the amount is 
sufficient to control stalk borers. 
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48. An isolated and purified promoter capable of 
directing pith-pr ferred expression of an associated structural 

gene in a plant. 

49. A promoter of claim 48 , isolated from a monocot. 

50. A promoter of claim 48, isolated from a maize 

plant . 

51. A promoter of claim 48, isolated from a plant 
tryptophan synthase-alpha (TrpA) subunit gene. 

52. A promoter of claim 51, isolated from a maize 
tryptophan synthase-alpha (TrpA) subunit gene. 

53. A promoter of claim 48, comprising the sequence set 

forth in Figure 24 . 

54 . A recombinant DNA molecule comprising a promoter of 
claim 48, operably associated with a structural gene encoding a 

protein of interest. 

55. A recombinant DNA molecule of claim 54, wherein 
said structural gene encodes an insecticidal protein. 

5S. A recombinant DNA molecule of claim 55, wherein 
said structural gene encodes a Bacillus 1-hiirinaiensis protein. 

57. A vector, comprising a recombinant DNA molecule of 
claim 54 . 

58. A vector of claim 57, wherein said structural gene 
encodes an insecticidal protein. 

59. A vector of claim 57, wherein said structural gene 
encodes a Bacillus thurinaiensis protein.- 

SO . A vector, comprising at least two recombinant DNA 
molecules of claim 54, wherein at least one of the two 
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structural genes encodes an insecticidal protein. 

61. A plant stably transformed with recombinant DNA 
molecule of claim 54 . 

62. a plant of claim 61, which is . a maize plant. 

63. A purified promoter capable of directing 
pollen-specific expression of an associated structural gene in a 
plant, wherein said promoter is isolated from a plant 
calcium-dependent phosphate kinase (CDPK) gene. 

64. A promoter of claim 63, isolated from a monocot CDPK 



gene. 



gene 



65. A promoter of claim 63, isolated from a maize CDPK 



66. A promoter of claim 65, comprising the sequence set 
forth in Figure 35. 

67. A recombinant DNA molecule, comprising a promoter 
of claim 63, operably associated with a structural gene encoding 
a protein of interest. 

68. A recombinant DNA molecule, of claim 67, wherein 
said structural gene encodes an ' insecticidal protein. 

69. A recombinant DNA molecule of claim 63, wherein 
said structural gene encodes a Bacillus thurinaiensis protein. 

70. A vector, comprising at least one recombinant DNA 
molecule of claim 67. 

71. A vector of claim 70, wherein said structural gene 
encodes an insecticidal protein. 

72. A vector of claim 71, wherein said structural gene 
encodes a Bacillus thurinaiensis protein. 
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73. A vector of claim 70, comprising two recombinant 
DNA molecules, wherein at least one of the two structural genes 
encodes an insecticidal protein. 

74 . A plant stably transformed with; a recombinant DNA 

molecule of claim 67 . 

75. A plant of claim 74, which is maize plant. 

76. A maize plant stably transformed with at least one 

recombinant DNA molecule; 

wherein said DNA molecule comprises a promoter 
operably linked to a nucleotide sequence encoding an 
insecticidal protein. 

wherein said promoter is capable of directing 
tissue-preferred or tissue-specific expression of said gene in 

said maize plant. 

77. A maize plant of claim 76, wherein said gene 
encodes a Bacillus thuringiensis protein. 

78. A plant of claim 76, wherein said promoter is 
obtained from a monocot. 

79. A plant of claim 78, wherein said monocot is 

maize. 

80. A maize plant of claim 76, wherein said promoter 
is capable of directing pith-preferred expression of said gene 

in said maize plant. 

81. A plant of claim 76, wherein said promoter is 
obtained from a plant tryptophan synthase- alpha subunit gene. 

82. A plant of claim 76, wherein said promoter is 
obtained from a maize tryptophan synthase-alpha subunit gene. 
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33. A plant of claim 32, wherein said promoter 
comprises the sequence set forth in Figure 24 . 

84. A plant of claim 76, wherein said promoter is 
capable of directing green tissue-specific expression of said 
gene in said maize plant. 

85. A plant of claim 84, wherein said promoter is a 
PEP carboxylase promoter . 

86. A maize plant of claim 76, wherein said promoter 
is capable of directing pollen-specific expression of said gene 
in said maize plant. 

37. a maize plant of claim 76, wherein said promoter 
is obtained from a plant calcium-dependent phosphate kinase 
gene. 

88. A maize plant of claim 76, wherein said promoter 
is obtained from a maize calcium-dependent phosphate kinase 
gene. 

89. A maxze plant of claim 76, wherein said oromotsr 
comprises the sequence set forth in Figure 35. 

90. A method of producing a maize optimized coding 
sequence for an insecticidal 3.t. protein, comprising: 

determining the amino acid sequence of a predetermined 
insecticidal B.t. protein, and 

altering the coding sequence of the protein by 
substituting codons which are most preferred in maize for 
corresponding native codons. 

91. A method of protecting a maize plant against at 
least one insect pest, comprising: 
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stably transforming a maize plant with at least on 
nucle tide sequence of claim 9, wherein the coding sequence 
encodes an insecticidal protein; whereby the transformed maize 
plant expresses the insecticidal protein in an amount sufficient 
to protect the plant against the pest. 

92. The method of claim 91, wherein said insecticidal 
protein is a 3.t . protein. 

93. The method of claim 91, wherein said promoter is a 
tissue-specific or a tissue-preferred promoter. 

94 . A plant which has been stably transformed with a 
nucleotide sequence wherein said nucleotide sequence comprises a 
maize optimized coding sequence for an insecticidal protein, 
wherein said protein is expressed in said transformed plant at 
least 100 fold greater than expression of the protein using a 
native coding sequence. 
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FIG. I 

j. j 20 30 4s. 50 60 

* * * * * * 

BTHKURHD ATGGATAACAATCCGAACATCAATGAATGCATTCCTTATAATTGTTTAAGTAACCCTGAA 

f Isynbt. fin C C.C C..G...-. .C..C..C..C..CC.G..C C..G 

bssyn C. - . . .C. .C C .G C.C..C..C..CC.G..C C .G. 

70 80 90 100 110 120 

* * * * * * 

-BTHKURHD GTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCAATCGATATTTCCTTG 

flsvnbr.f in . .G. .G. .GC.G. -C .C .GC.C. .C .G. .C . .C C C. .CAG.C. . 

bssyn . . G . . G . .GC.G. . C . C .GC.C. . C . G . . C . C C C . CAG . C . . 

130 140 150 160 170 130 

* * * * * # 

BTHKURHD TCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGACTA 

f Isynbt. f in AGC .G . .C .G . .C. .GC C. .G. .C. .G C..C..C..C...C.G..C..G 

bssyn AGC. .C.C.G..C.GC...C.G..C.G C..C..C..C...C.G..C..G 

190 200 210 220 230 240 

* * * * » * 

BTHKURHD GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 

f Isynbt. fin . .G. .C. .C. .C C.C. C.C, .AGC. .G C.C..G..G..G..C 

bssyn . .G. .C .C .C C. .C. .C .C. . .AGC. .G C . . C . G . . G . . G . . C 

250 260 270 280 290 300 

* * * * * * 

BTHKURHD GAACAGTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCAITTCTAGATTA 

f Isynbt . fin . .G. . -CC .C GC.C. .C. .G. .G CC.C G CAGCCCC.G 

bssyn ..G...C.G..C GC.C. • C - . G . . G CC.C G CXGCC . CC . G . 

310 320 330 340 350 360 

* * * * * * 

3THKURHD GAAGGACTAAGCAATCTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGAT 

f Isynbt . fin ..G..C..G C..G..C C C. , GAGC . .CC.C G . . C . . C 

cssyn . . G . . C . 3 C..G..C C C. . GAGC . . CC . C G . . C . C 

370 380 390 400 410 420 

* ' * * * * * 

BTHKURHD CCTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTGCC 

f Isynbt . fin . .C. .C .C .C. .CCGC.C .G C..C..G C C. . . 

bssyn . . C . C . C . C. . CC. GC C . G C..C..G C C... 

430 440 450 460 470 480 

* * * * * * 

BTHKURHD CTTACAACCGCTATTCCTCTTTTTGCAGTTCAAA^ 

f Isynbt. fin ..G..C C.C.C.G..C.C.G..G..C.C.G..C.C .GC.GAGC. .G 

bssyn ..C.C C.C.C.G..C.C.C.G..C.C.G..G..C.GC GAGC . . G 

490 500 510 520 530 540 

****** 

3THKURHD TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGG 

f Isynbt . fin C. G. .G.. C.C. CC.C .CC.GAGC. .GC..C.C..C. . CAGC C. .C. .G 

bssyn . . C . C . C . C . C . CC G . . CC GAGC . .GC.CC.C. CAGC C..C..G 

550 560 570 580 590 600 

* * * * * * 

BTHKURHD AGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGCTTACT 

f Isynbt. fin CC C..C..C C.C C. C.C.. C.C. CC.C. CC.C. C.C 

bssyn CC C . C. . C C.C C .C.C .C.C.CC.G..CCC . G . . c 
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I2-L. 1220 1230 124c 1250 1260 

* * ♦ * * * 

BTHKURHD TACAGAAAAAGCGGAACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTG 

f Isynbt.f in . . .CC. .G C..C..G. .CAGC C.C.C.C.T C 

bssy n . , . C C. . G C ♦ C. . G . . CAGC .....C.C.C.C.T C 

■i 

1270 1280 1290 1300 1310 1320 

* * * * * * 

3THKURHD CCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGTTTCAATGTTTCGTTCAGGCTTT 

'flsynbt.fin CA..C.C.C.C.C.TCG C. . GAGC C . . CAGT C 

bssyn - CA..C.C..C.C.C.TCG C . GAGC C . . CAGT C 

1330 1340 1350 1360 1370 1380 

****** 

3THKURHD AGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATG7TCTCTTGGATACATCGTAGTGCT 

flsynbt.fin ..C..C..C..C..G..C..C..CC.T..A AGC T..C..C C 

bssyn ..C.C.C.C.C.C.C.CCT..A AGC C C . C C 

1390 1400 1410 1420 1430 

* * * * * 

3THKURHD GAATTTAATAATATAATTCCTTCATCA--CAAATTACACAAATACCTTTAACAAAATCTA 
flsynbt.fin ..C.C.C.C.C.C.— ..G..GC.C.C.C.G..C.CCC.C .GAGC. 
bssyn ..C.C.C.C.C.C. — . . C . GC . G . . C. , C . C . C . CC G . . C . GAGC . 

1440 1450 1460 14?: 1480 149C 

* * * * - * * 

3THXURHD C7AATCTTGGCTCTGGAACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 
flsynbt.fin .C. .C .G. . .AGC. .C . CAGC . .G..G..G..C..C..C..C..C,.C..C..C..C, 
bssyn ,C .C. .G. . .AGC. .C .CAGC .G . .G . .G . . C . . C . .C . .C . .C . .C . .C . .C . .C . 

1500 1510 1520 1530 1540 1550 

* * * * . * * 

3THKURHD TTCGAAGAACTTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTAT 

f Isynbt. fm . G . . CC . C . .CAGC. .C CAGC - . .C .GCC . .G . .C. .C. .C. .C. -CC.GA 

bssyn .3. .CC.C. .CAGC. . C CAGC. . . C . GC . C. . G . . C . . C . . Z . . Z . . CC . GA 

1560 1570 1580 1590 1600 1610 

* * * » * * 

3THKURHD CACAAAGATATCGGGTAAGAATTCGCTACGCTTCTACCACAAATTTACAATTCCATACAT 

f lsynbc .fin GC . . GC . C . . C . . C .' . CC . C . . C CAGC C . CC C . G C . CA 

bssyn GC . . GC . C . . C . . C . . CC . C . . C CAGC C - . CC . G . . G C . . CA 

1620 1630 1640 1650 1660 1670 

* * * * * * 

3THKURHD CAATTGACGGAAGACCTATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGTGGGAG7A 

flsynbt.fin GC. .C CC.C..C. .C -C . . .C. .C. .CAGC. .C ,C C..C..C..C. 

ibssyn GC.C CC.C..C..C..C C..C. .CAGC. .C..C C..C..C..C. 

1680 1690 1700 1710 1720 1730 

****** 
^THKURHD ATTTACAGTCCGGAAGCTTTAGGACTGTAGGTTTTACTACTCCGTTTAACTTTTCAAATC 

flsynbt.fin .CC.G. . .AG. . .C CC. C . .C . .G . . C . . C . . C . .C . . C . - C CAGC. .C. 

bssyn .CC.G...AG...C CC.C..C..G..C..C..C..C..C..C CAGC. .C. 

1740 1750 1760 1770 1780 1790 ■ 

* * * * * * 

3THKURHD GATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAG 

flsynbt.fin .CAGC. .C.G..C.CCG..C.C..C.G CAGC C.G..C.C.C 

bssyn .CAGC. .C.G..C.CCG..C.C.C.G CAGC C..G..G..C..C. 



2/94 



' " WO 93/07278 PCT/US92/08476 

FIG. I tCONT) 

1300 310 1820 1830 iw 40 1350 

****** 

3THKURHD ATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAAGAG 

f Isynbt.f m .C..C..C..G..C..G..C..C..G..G C C. .G. .C. .CC.G . .G G 

bssyr. . C. . C . . C . . G . , C . .G. . C . . C. . G . .G C C. . g! . c! CC." G " *G " *G " 

* 1360 1870 1380 1890 1900 1910 

* * * * ★ * 

_ 3THXURHD CACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATG 

<f isynbt.f in .T..G.. ...C C C. ,CAGCAG. . .C. .G CC.G. G C C 

oss Y n -T. .G C C C. .CAGCAG. ..C..G CC .G . . G /.c\\ C. 

1920 1930 1940 1950 1960 1970 

* * * * * * 

3THKURHD TGACGGATTATCATATTGATCAAGTATCCAATTTAGTTGAGTGTTTATCTGATGAATTT^ 

f Isynbt.f in C. .C. .C. .C. .C G. .GAG. . .CC.G. .G CC.GAGC. .C. .G 

bssyn ....C..C..C..C..C... 1.1 

1980 1990 2000 2010 2020 2030 

* * * * « * 

3THKURHD GTCTGGATGAAAAAAAAGAATTGTCCGAGAAAGTCAAACATGCGAAGCGACTTAGTGATG 

flsynbt.fm .C C.G. .G. .G. .GC. .AG G. .G. .G. .C..C C G r r 

bssyn . 1-11„1 

2040 2050 2060 2070 2080 2090 

* * * * * * 

BTHKURHD AGCGGAATTTACTTCAAGATCCAAACTTTAGAGGGATCAATAGACAACTAGACCGTGGCT 

dsynbt .f in C. .CC.G. .G. .G. .C. .C CC.C. .C CC.C. .G G c 

bssyn 11-11111-1111 

^-00 2110 2120 2130 2140 2150 

* * * * * * 

3THKURHD GGAGAGGAAGTACGGATATTACCATCC.\AGGAGGCGATGACGTATTCAAAGAGAATTACG 

^lsynbt . fin . .C.C. .C. .C. .C. .C. .C G..C C G G. ^ 

bssyn 11-1111111111 

2160 2170 2180 2190 2200 2210 

* * * * * ** 

3THKURHD TTACGCTATTGGGTACCTTTGATGAGTGCTATCCAACGTATTTATATCAAAAAATAGATG 

flsynbt.fm .G. .C, .GC. . . .C C..C CC.C. CC.G C G G C C 

bssyn 1-11-11_1„11_11_1 

2220 2230 2240 2250 2260 2270 

* * * * * » 

BTHKURHD AGTCGAAATTAAAAGCCTATACCCGTTACCAATTAAGAGGGTATATCGAAGATAGTCAAG 

fxsynbt.fin . . AGC . . GC. G . . G C C GC.GC.C. .C..C G. .C. .C. .C 

1 bssyn 

2280 2290 2300 2310 2320 2330 

****** 

3THKURHD ACTTAGAAATCTATTTAATTCGCTACAATGCCAAACACGAAACAGTAAATGTGCCAGGTA 

f lsynbt. f in ..C.G..G CC.G..C C G G. .C. .G. .C CC 

bssyn . 1-11-11111.11.1 

2340 2350 2360 2370 2380 2390 • 

* * * * • » 

3THKURHD CGGGTTCCTTATGGCCGCTTTCAGCCCCAAGTCCAATCGGAAAATGTGCCCATCATTCCC 

flsynbt.fin .C. .CAG.C.G C..GAGC C..C..C C.G. C C CAG 

bssyn 11_11111_11 11 
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iCCNT) 



2400 ^ .10 2420 2430 2*»*0 2450 

* * * * * * 

BTHKURHD ATCATTTCTCCTTGGACATTGATGTTGGATGTACAGACTTAAATGAGGACTTAGGTGTAT 

f lsynbt. fin .C. .C . .AG.C C .C .G. . C .C.C. .C.G..C CG. .C .C 

bssyn 

* 2460 24.70 2480 2490 2500 2510 

* * * * * * 

BTHKURHD GGGTGATATTCAAGATTAAGACGCAAGATGGCCATGCAAGACTAGGAAATCTAGAATTTC 

•<f lsynbt. f in c c C..G..C c.ccc.g..c.c.g..g..c 

bssyn 

2520 2530 2540 2550 2560 2570 

****** 

BTHKURHD TCGAAGAGAAACCATTAGTAGGAGAAGCACTAGCTCGTGTGAAAAGAGCGGAGAAAAAAT 

f lsynbt. f in .G. .G G..CC.G..G..C..G..C..G..C..C GC.CC G..G. 

bssyn 

2580 2590 2600 2610 2620 2630 

* * * * * * 

BTHKURHD GGAGAGACAAACGTGAAAAATTGGAATGGGAAACAAATATTGTTTATAAAGAGGCAAAAG 

f lsynbt .fin ..C.C G..C..G..GC....G G..C..C..C..G..C..G C.C 

bssyn . 

2640 2650 2660 2670 2680 269C 

* * * * # * 

BTHKURHD AATCTGTAGATGCTTTATTTGTAAACTCTCAATATGATAGATTACAAGCGGATACCAACA 

f lsynbt. f in . GAGC . .G..C.CC.C.C..C. . AGC . . G . . C . CC CC G . . G . . C . C 

bssyn 

2700 2710 2720 2730 2740 2750 

* ****« 

3THKURHD TCGCGATGATTCATGCGGCAGATAAACGCGTTCA : AGCATTCGAGAAGCTTATCTGCCTG 
f lsynbt. fin . . . .C C.C.C.C.C.G C.C C. .G. .C. -C C. 

bssyn 

2760 2770 2780 2790 2800 2810 

* * * * * ■ * 

3THKURHD AGCTGTCTGTGATTCCGGGTGTCAATGCGGCTATTTTTGAAGAATTAGAAGGGCGTATTT 

f lsynbt .fin AGC C.C. .C. .G. . C . C . C • . C . C . G . . GC . G . . G . . C .C.C. 

bssyn 

2320 2830 2840 2850 2860 2870 

* * * * * * 

BTHKURHD TCACTGCATTCTCCCTATATGATGCGAGAAATGTCATTAAAAATGGTGATTTTAATAATG 
f lsynbt. fin ....C.C..AG...G..C.C.CCC.C.C.C.G..C.C.C.C.C.C 
Ibssyn • 

2830 2890 2900 2910 2920 2930 

** ****** 

BTHKURHD GCTTATCCTGCTGGAACGTGAAAGGGCATGTAGATGTAGAAGAACAAAACAACCACCGTT 

f lsynbt - fin . . C GAG G..C.C.G..C.C.G..G..G CA 

bssyn 

2940 2950 2960 2970 2980 2990 - 

* * * * * * 

BTHKURHD CGGTCCTTGTTGTTCCGGAATGGGAAGCAGAAGTGTCACAAGAA 

f lsynbt. fin GC . .G . .G . .G . .G . .C. .G G..C.C. .AGC. .G..G..G..C.G..C..C 

bssyn — 
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3000 jolO 3020 3030 3Un0 3050 

* * * * # # 

3THKURHD GTCGTGGCTATATCCTTCGTGTCACAGCGTACAAGGAGGGATATGGAGAAGGTTGCGTAA 

f Isynbt. fin .C. .C C G. .C. .G. -C. . C C..C..C..G..c!....G. 

bssyn 11111_1 

3060 3070 3080 3090 . 3100 3HQ 

* * * * # 

«THKURHD CCATTCATGAGATCGAGAACAATACAGACGAACTGAAGTTTAGCAACTGTGTAGAAGAGG 

**f Isynbt. f in ....C..C C..C G..C C C..G..G... 

bssyn 11-11_1111 



3120 3130 3140 3150 3160 3170 

* * * * * * 

BTHKURHD AAGTATATCCAAAC\ACACGGTAACGTGTAATGATTATACTGCGACTCAAGAAGAATATG 

f Isynbt. fin .G..G..C..C C..G..C..C..C..C..C..C..C..C..G..G..G..C 

bssyn 11-11.1 

3180 3190 3200 3210 3220 3230 

* * * * * * 

3THKURHD AGGGTACGTACACTTCTCGTAATCGAGGATATGACGGAGCCTATGAAAGCAAT^CTTCTG 

flsynbt.f in C. .C CAGC. .C..C..C..C..C C C..G CAGCAGC 

bssyn • 

3240 3250 3260 3270 3280 3290 

* * * * * # 

3THKURHD TACCAGCTGATTATGCATCAGCCTATGAAGAAAAAGCATATACAGATGGACGAAGAGACA 

f Isynbt. fin . G . . C. . C . . C . . C . .CAGC C..G..G..G..C..C..C..C..C..CC.C... 

bssyn 1-1111 



3300 3310 3320 3330 3340 3350 

* * * * * * 

3THKURHD ATCCTTGTGAATCTAACAGAGGATATGGGGATTACACACCACTACCAGCTGGCTATGTGA 

flsynbt.f in .C. .C. .C. .GAGC. . .C.C . .C. .C. .C. .C C . .C. .G . .C. .Z c. . . 

cssyn 1_1111 



3360 3370 3380 3390 3400 3410 

****** 
3THKURHD CAAAAGAATTAGAGTACTTC C CAGAAAC CG AT AAGGTATGGATTG A GATCGG AG AAACGG 

flsynbt.f in .C. .G. .GC.G C..G C G C C..G..C 

bssyn 111111^11.11.1 



3420 3430 3440 3450 3460 

* * . * * * 

BTHKURHD AAGGAACATTCATCGTGGACAGCGTGGAATTACTTCTTATGGAGGAATAA 

, f Isynbt. fin .G..C..C ..GC.G..G..G G..G 

"bssyn . . . TG . . G 
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BTHKURHD 
bssyn 



3THKURHD 



BTHKURHD 
bssyn 



3THKURHD 
bssyn 



BTHKURHD 
bssvn 



iw " .20 30 4u 50 60 
****** 
ATGGATAACAATCCGAACATCAATGAATGCATTCCTTATAATTGTTTAAGTAACCCTGAA 
C CC CG C..C..C..C..CC.G..C CG 

70 80 90 100 110 120 

****** 

GTAGAAG7ATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCAATCGATATTTCCTTG 
..CCGC.CCC . GC C. . C C C C C C. . CAG . C . . 

130 140 150 160 170 180 

* ***** 

TCGCTAACGCAATTTCTTTTGAGTGA^^ 

AGCCCCCGC C. .G. .C. .G CCCC.CG..CG 

190 200 210 220 230 240 

****** 

GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 
..G..CCC C.CCC . AGC . . G C..C..G..G..G..C 

250 260 270 280 290 300 

* * * * * * 

GAACAGTTAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTA 
..G...CCC G.-G CCC G CAGCC.CC.G 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



BTHKURHD 
'bssyn 



BTHKURHD 
bssyn 



310 320 330 340 350 360 

* * * * * * 

JAAGGACTAAGCAATCTTTATCAAATTTACGCAGAATCTTTTAGAGAGTGGGAAGCAGAT 
. . C • C G C G . . C C C GAGC . . CC C CCC 



370 380 390 400 410 420 

* * * * * * 

CCTACTAATCCAGCATTAAGAGAAGAGATGCGTATTCAATTCAATGACATGAACAGTCCC 
. . C . . C . . C . . C .CC.GC.C. . G C C G C C . 

430 440 450 460 470 480 

****** 
CTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTA 
t . C C. . C . C .G. . C . C . G . . G . . C . C • G . . G . . C . GC . GAGC . . G 

490 500 510 520 530 540 

* * * * * * 
TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 
..CCCCCCCG. .CCGAGC .GC. .C.C. .C. . CAGC CCG 

550 560 570 580 590 600 

****** 
AGGTGGGGATTTGATGCCGCGACTATCAATAGTCGTTATAATGATTTAACTAGGCTTATT 
C ,C CC.C C.C C . C . . C .C . C .CCG. . CC C . G . . C 

610 620 630 640 650 660 

****** 

GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 
C. .C - C .C. .C .G CCCCG C..G T 



670 



680 690 700 710 720 

* * * * * * 

CCGGATTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTA 

. ,C .CAGCC.C. .C CCCC.CCCCCCGCG..CG..CG 
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6x0 620 630 64u 650 660 

* * * * * * 

BTHKURHD GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 

f Isynbt.f in C .C. .C. .C. .C . .G C. .C. .CC.G C.G T 

bssyn C. .C. .C. .C. .C. .G C .C .CC.G C.G T 

670 680 690 700 710 720 

* * * ' * * * 
-.BTHKURHD CCGGATTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTA 
''f isynbt. fin . .C .CAGCC.C .C C C .C.C.G..CC CC.C, GCG..C.G.. C.G 

bssyn . . C . CXGCC . C . C C.G..C.C.G..CCCCC..GC.G..C.G..C.G 

730 740 750 760 770 780 

* * * * * ♦ 

3THKURHD TTAGATATCGTTTCTCTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTT 

f Isynbt.f in C.G. .C GAGC. .C .C. .C C.C.CCC.C.C.C.C.C.C.G 

bssyn C.G. . C GAGC. . G. .C. .C C . . C . .CC.C . .C . . C . . C . . C . .C . ,C . .Q 

790 300 310 820 330 840 

» * * * * * 

3THKURHD TCCCAATTAACAAGAGAAATTTATACAAACCCAGTATTAGAAAATTTTGATGGTAGTTTT 

f Isynbt. fin AG . . .GC.G. .CC.C. .G C..C C . .GC .G . .G . . C . . C . .C . .C . .C .C 

bssyn AG . . . GC . G . . CC . C . . G C..C C.GCG..G.. CC.C. CC.C 

350 360 370 380 390 900 

* * * * + * 

BTHKURHD CGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 

f Isynbt. fin . ,C .AGC .C CG..C.C.CC.CC.CCC CCG 

bssyn . .C. . . AGC . .C C.G..C.C.CCCCCCC CC.G 

910 920 930 940 950 960 

* * * * *■ * 

BTHKURHD AACAGTATAACCATCTATACGGATGCTCATAGAGGAGAATAITATTGGTCAGGGCATCAA 

f Isynbt . fin C..C C..C..C..C..CC.C..C..G..C..C.. .AGC. .C. .C .G 

bssyn C . C C..C..C..C..CC.C..C..G..C..C... AGC . . C . . c . . g 

970 980 990 . 1000 1010 1020 

* * * * * * 

3THKURHD ATAATGGCTTCTCCTG7AGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATGGAACT 

f Isynbt. fin : .C CAGC. .CC.C .CAGC .CC.G C..C..C..G..C..C..C 

bssyn . . C CAGC . . C . . C . . C . . CAGC . . C . C . G CC.C. C. CC.C 

1030 1040 1050 1060 1070 1080 

* * * * # * 

3THKURHD ATGGGAAATGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGA 

. p f Isynbt. fin CCT..A..T..CG.-.C.C..G..A..G..CC A CC.C 

' bssyn CC..T..A..T..G..G..C.C.G..A..G..G..C A CC.C 

1090 1100 1110 1120 1130 1140 

* * * * * + * 

3THKURHD AC A TT ATCGTC C ACTTT AT AT A G AAG AC CTTTTAAT AT AG GG AT AAATAATCAACAACTA 

f Isynbt.f in . . CC . GAGCAG . . .CC.G. .CC.TC C..C-.C..C..C..C.C..G..G..G 

bssyn . .CC. GAGCAG. ..CC.G. .CC.TC C..C..C..C..C..C..C..G..G..G 

1150 1160 1170 1180 1190 1200 

* * # * « * 

BTHKURHD TCTGTTCTTGACGGGACAGAATTTGCTTATGGAACCTCCT 

f Isynbt. fin AGC. .G..G C..CCC.C.C..C. .AG. AGC. .CC CAG . ..C.G 

bssyn AGC . .G..G C..C..G..C..C..C..C. .AG. AGC. .CC CAG. ..C.G 
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"IG. 



ICONT) 



BTHKURHD 
bssyn 



BTHKURHD 
.bssyn 



BTHKURHD 
bssvn 



BTHKURHD 
bssyn 



BTHKURHD 
bssvn 



BTHKURHD 
bssyn 



BTHKURHD 
bssvn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



7 _ ^ 740 750 76 770 780 

" * * * * * * 

TTAGATATCGTTTCTCTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTT 
C.G. .C GAGC. .G. .C. .C C . . C . . CC . C . . C . . C . . C. . C . . C . . C. . G 



790 



800 310 820 830 340 

***** 

TCCCAATTAACAACAGAAATTTATACAAACCCA^ 

AG. . .GC.C..CC.C..G CC C .GC.G..G. . C . . C . . C . . C . . C . . C 

850 360 370 380 390 900 
****** 
CGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 
. .C .AGC. .C C .G. .C. .C. .CC.C. -C .C. .CC CCG 

910 920 930 940 950 960 
****** 
AACAGTATAACCATCTATACGGATGCTCATAGAGGAGAATATTATTGGTCAGGGCATCAA 
C..C C. . C .C. . C . .CC.C. .C. . C . • C . . C . . AGC . . C . . C . . G 

970 980 990 1000 1010 1020 

* * » * * . * 
ATAATGGCTTCTCCTGTAGGGTTTTCGGGGCCAGAATTCACTTTTCCGCTATATGGAACT 
. .C CAGC .C .C. .C. .CAGC. .CCC...CC.CCCCC 

1030 1040 1050 1060 1070 1080 

****** 

ATGGGAAATGCAGCTCCACAACAACGTATTGTTGCTCAACTAGGTCAGGGCGTGTATAGA 
CC..T..A..T..G..G..CCG..A..G..G..C A CC.C 

1090 1100 1110 1120 1130 1140 

****** 

ACATTATCGTCCACTTTATATAGAAGACCTTTTAATATAGGGATAAATAATC.\ACAACTA 
. .CCGAGCAG. . .CCCCCTC C . . C . .C . . C . C .C . .C . . G . .G . .C 

1150 1160 1170 1180 1190 1200 

* * * * * * 
TCTGTTCTTGACGGGACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTA 
AGC ..C.G C .C.C.C.C .C.C . .AG. AGC . . CC CAG ...C.G 

1210 1220 1230 1240 1250 1260 

****** 

TACAGAAAAAGCGGAACGGTAGATTCGCTCGATGAAATACCGCCACAGAATAACAACGTG 
. . .CCG CCG. .CAGC C.CCCT C 

1270 1280 ' 1290 1300 1310 1320 
****** 
CCACCTAGGCAAGGATTTAGTCATCGAT^AAGCCATGTTTCAATGTTTCGTTCAGGCTTC 
... ...CA..C. C.C. C.C. TCG C. GAGC C CAGT C 



1330 



1340 



1350 



1360 



1370 



1380 
* 

AGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGGATACATCGTAGTGCT 
. .C .C .C .C .G. .C .C .CCT . .A AGC T. .CC C 

1390 1400 1410 1420 1430 

* * » * ♦ 

GAATTTAATAATATAATTCCTTCATCA — CAAATTACACAAATACCTTTAACAAAATCTA 
. .G. ,C .C . C . C .C . — .,CGCCC.C..G..CCC.G;.C GAGC . 
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FIG . 2 (CONT) 



BTHKURHD 
bssyn 



3THKURHD 
£ssyn 



BTHKURHD 
bssyn 



BTHKURHD 
bssyn 



1440 1450 1460 1470 1**0 1490 

****** 

CTAATCTTGGCTCTGGAACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 
. C . . C. . G . . . AGC. .C. .CAGC. .G..G..G..C..C..C..C..C..C..C..C..C. 

1500 1510 1520 1530 1540 1550 

* * * * * * 

TTCGAAGAACTTCACCTGGCCAGATTTCAACCTTAAGAGTAAATATTACTGCACCATTAT 
,G. .CC.C. .CAGC. .C CAGC. . - C . GC. C. .G . .C. . C. . C . . C . . CC .GA 

1560 1570 1580 1590 1600 1610 

****** 

CACAAAGATATCGGGTAAGAATTCGCTACGCTTCTACCACAAATTTACAATTCCATACAT 
GC . . GC . C . . C . . C . . CC . C . . C CAGC C .. CC . G G ..... C .'. CA 

1620 1630 1640 1650 1660 1670 

* * * * * * 

CAATTGACGGAAGACCTATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGTGGGAGTA 
GC..C CC;C. .C. .C. ,C C. .C. .CAGC. .C. .C C. ,C. .C. .C. 



BTHKURHD 
bssyn 



.680 1690 1700 1710 1720 1730 

* * * * * * 

ATTTACAGTCCGGAAGCTTTAGGACTGTAGGTTTTACTACTCCGTTTAACTTTTCAAATG 
.CC.G. . .AG. . .C CC,C..C..G..C..C..C..C..C..C CAGC. .C. 



BTHKURHD 
bssyn 



1740 1750 1760 1770 1780 1790 

* * * * * * 

GATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAG 
. CAGC ..C..G..C..CC.G..C..C..C..G CAGC C..G..G..C..C. 



3THKURK2 
bssvn 



.800 1810 1320 1830 1340 1850 

* * * * * * 

ATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATATGATTTAGAAAGAG 
.C..C..C..G..C..G..C..C..G..G C C . . G . . C . . CC . G . .G , .G . 



1360 1370 1880 1890 1900 1910 

* * * * * * 

BTHKURHD CACAAAAGGCGGTGAATGAGCTGTTTACTTCTTCCAATCAAATCGGGTTAAAAACAGATG 
bssyn .T..G C C C. .CAGCAG. . , C . . G CC.G. . G . . C . . C . 



BTHKURHD 
bssyn 



1920 1930 1940 
* * * 
TGACGGATTATCATATTGATCAAGTATCC 
C. .C. .C. .C. .C G. . G. AG 
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FIG . 3 

iw 20 30 4o 50 60 

* * ★ * » * 

synlT . mze ATGGACAACAACCCCAACATCAACGAGTGCATCCCCTACAACTGCCTGAGCAACCCCGAG 

bssyn 

synful.mod 

» 

70 80 90 100 110 120 

* * * * * * 

synlT . mze GTGGAGGTGCTGGGCGGCGAGCGCATCGAGACCCGCTACACCCCCATCGACATCACCCTG 

'bssyn 

synful .mod 

130 140 150 160 170 180 

* * * * * * 

synlT • m2e AGCCTGACCCAGTTCCTGCTGAGCGAGTTCGTGCCCGGCGCCGGCTTCGTGCTGGGCCTG 

bssyn 

synful.mod 

190 200 210 220 230 240 

* * * ★ * * 

SynlT . mze GTGGACATCATCTGGGGCATCTTCGGCCCCAGCCAGTGGGACGCCTTCCTGGTGCAGATC 

bssyn 

synful.mod 

250 260 2"C 280 Z90 30C 

* * * * * « 

synlT . mze GAGCAGCTGATCAACCAGCGCA7CGAGGAGTTCGCCCGCAACCAGGCCATCAGCCGCCTG 

bssyn 

synful.mod 

310 320 330 340 350 360 

* * * » * * 

syr.iT . mze GAGGGCCTGAGCAACCTGTACCAGATCTACGCCGAGAGC77CCGCGAGTGGGAGGCCGAC 

bssyn a 

= /r.rul . mod A 

370 380 ' 390 400 410 42C 

« * * * * » 

synlT . raze CCCACCAACCCCGCCCTGCGCGAGGAGATGCGCATCCAGTTCAACGACATGAACAGCGCC 

bssyn 

synful.mod 

430 440 450 460 470 480 

* * ★ * * * 

SynlT . mze CTGACCACCGCCATCCCCCTGTTCGCCGTGCAGAACTACCAGGTGCCCCTGCTGAGCGTG 

bssyn 

1 synful.mod 

490 500 510 520 530 540 

* * * ★ * * 

synlT . mze TACGTGCAGGCCGCCAACCTGCACCTGAGCGTGCTGCGCGACGTGAGCGTGTTCGGCCAG 

bssyn C 

synful.mod C 

550 560 570 580 590 600 

* * * * * * 

synlT . raze CGCTGGGGCTTCGACGCCGCCACCATCAACACCCGCTACAACGACCTGACCCGCCTGATC 

bssyn 

synf ul .mod . 

10/94 



WO 93/07278 PCT/US92/08476 

FIG. I (CCN7) 

b.O 620 630 6<^ 650 660 

* * * » * * 

synlT.m2e GGCAACTACACCGACCACGCCGTGCGCTGGTACAACACCGGCCTGGAGCGCGTGTGGGGC 
bssyn " T 

synf ul . zod ! 

670 680 690 700 710 720 

* * * * * ^ 

synlT . zze CCCGACAGCCGCGAC7GGA7CCGC7ACAACCAG7TCCGCCGCGAGC7GACCC7GACCG7G 

bssyn A.G 

synful.-aod A.G 

730 740 750 760 770 780 

* * * * * * 

synlT . zze CTGGACATCGTGAGCCTGTTCCCCAACTACGACAGCCGCACCTACCCCATCCGCACCGTG 

bssyn a 

synf ui . zod 

790 300 810 820 830 840 

* » * * * * 

synlT . zze AGCCAGC7GACCCGCGAGA7C7ACACCAACCCCG7GC7GGAGAAC77CGACGGCAGC77C 
bssyn 7 

synful.rrod 7 

350 360 370 330 390 90C 

* * * . * * * 

synlT . zze CGCGGCAGCGCCCAGGGCATCGAGGGCAGCATCCGCAGCCCCCACCTGATGGACATCCTG 
bssyn - 

synf ul . nod ......./....[ 

910 920 930 940 950 96C 

****** 

syniT . AACAGCATCACCATCTACACCGACGCZCACCGCGGCCAG7AC7ACTGGAGC3GCCACCAG 
bssyn 

970 980 990 1000 1010 102C 

****«♦ 

5ynl7 . zze A7CA7GGCCAGCCCCG7GGGC77CAGCGGCCCCGAG77CACC77CCCCC7G7ACGGCACC 

bssyn C 

synf ul .niod C .././../....] 

1030 1040 1050 1060 1070 1080 

****** 
synlT.aze A7GGGCAACGCCGCCCCCCAGCAGCGCA7CG7GGCCCAGC7GGGCCAGGGCG7G7ACCGC 

bssyn T. .A. .7 A .A 

synf ui. nod 7. .A. .7 A . A 

1090 1100 1110 1120 1130 1140 

* * * * * * 

synl7 . zze ACCC7GAGCAGCACCC7G7ACCGCCGCCCCTITCAACA7CGGCA7CAAC\ACCAGCAGCTG 

bssyn . 7 . . A . . 7 

synf ul . nod 7. .A. .7 

1150 1160 1170 1180 1190 1200 

****** 

synl7 . mze AGCG7GC7GGACGGCACCGAG7TCGCC7ACGGCACCAGCAGCAACC7GCCCAGCGCCG7G 
bssyn 

synful .mod * \ * ' 
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FIG. 3 (CONT) 

li.J 1220 1230 12^^ 1250 1260 

* * * * * * 

synlT . raze TACCGCAAGAGCGGCACCGTGGACAGCCTGGACGAGATCCCCCCCCAGAACAACAACGTG 

bssyn T 

synful.mod T 

1270 1280 1290 1300 1310 1320 

* * T # * t 

synlT.mze CCCCCCCGGCAGGGCTTCAGCCACCGCCTGAGCCACGTGAGCATGTTCCGCAGCGGCTTC 

bssyn . .A. . T. . A T T 

synf ul . mod • . A. .T. • A T T. . . . . . 

1330 1340 1350 1360 1370 1330 

* * * * * * 

synlT . mze AGCAACAGCAGCGTGAGCATCATCCGCGCCCCCATGTTCAGCTGGATCCACCGCAGCGCC 

bssyn T..A..T T T . . . 

synful .mod T..A..T T T. . . 

1390 1400 1410 1420 1430 1440 

synlT . nze GAGTTCAACAACATCATCCCCAGCAGCCAGATCACCCAGATCCCCCTGACCAAGAGCACZ 

bssyn 

synful.mod 

1450 1460 147Q 1480 1490 150C 

* * * * * * 

synlT . mze AACCTGGGCAGCGGCACCAGCGTGGTGAAGGGCCCCGGCTTCACCGGCGGCGACATCCTG 

bssyn 

synful.mod 

1510 1520 1530 1540 1550 1560 

* ★ * * * * 

syr. 17 . ~ze CGCCGCACCAGCCCCGGCCAGA7CAGCACCCTGC3CGTGAACATCACCGCCCCCCTGAGC 

bssyn 

syr.ru 1 . r.cd 

1570 1580 1590 1600 1610 162C 

* * * * * * 

synlT.mze CAGCGCTACCGCGTGCGCATCCGCTACGCCAGCACCACCAACCTGCAGTTCCACACCAGC 

bssyn C 

synful.mod C 

1630 1640 1650 1660 1670 1680 

* * # ★ » * 

synlT . mze ATCGACGGCCGCCCCATCAACCAGGGCAACTTCAGCGCCACCATGAGCAGCGGCAGCAAC 

bssyn 

synful.mod 

1690 1700 1710 1720 1730 1740 

* * # * * * 

synlT.mze CTGCAGAGCGGCAGCTTCCGCACCGTGGGCTTCACCACCCCCTTCAACTTCAGCAACGGC 

bssyn 

synful.mod 

1750 1760 1770 1780 1790 1300 

* * * * * * 

synlT . mze AGCAGCGTGTTCACCCTGAGCGCCCACGTGTTCAACAGCGGCAACGAGGTGTACATCGAC 

bssyn 

synful.mod 
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FIG . 3 (CONT) 
1820 1830 184w 1350 1360 

★ * * * * * 

synlT . mze CGCATCGAGTTCGTGCCCGCCGAGGTGACCTTCGAGGCCGAGTACGACCTGGAGCGCGCC 

bssyn A.G..T 

synf ul . mod A. G . . T 

: 1870 1880 1890 1900 1910 1920 

* * « " * » * 

^svnlT . mze CAGAAGGCCGTGAACGAGCTGTTCACCAGCAGC.kACCAGATCGGCCTGAAGACCGACGTG 

c bssyn 

synf ul .mod 

1930 1940 1950 1960 1970 1980 

* * * * * * 

synlT . mze ACCGACTACCACATCGACCAGGTGAGCAACCTGGTGGAGTGCCTGAGCGACGAGTTCTGC 

bssyn T 

synf ul .mod T 

1990 2000 2010 2020 2030 2040 

★ * ★ * * * 

synlT . mze CTGGACGAGAAGAAGGAGCTGAGCGAGAAGGTGAAGCACGCCAAGCGCCTGAGCGACGAG 

bssyn 

synf ul. mod 

2050 206C 20^0 2080 2090 210C 

★ * * * * « 

synlT . mze CGC.\ACCTGCTGCAGGACCCC.\ACTTCCGCGGCATCAACCGCCAGCTGGACCGCGGCTGG 

bssyn - 

synf ul. mod 

2110 2120 2130 2140 2150 2160 

★ * * * * * 

synlT . mze CGCGGCAGCACCGACATC ACC A TCC AGGGCGGCGACGACGTGTTCAAGG AG AA7TACGTG 

bssyn 

synf ul . mod 

■ 2170 1180 2190 2200 2210 2220 

» * » * » * 

synlT . mze ACCCTGCTGGGCACCTTCGACGAGTGCTACCCCACCTACCTGTACCAGAAGA7CGACGAG 

bssyn 

synf ul. mod 

2230 2240 2250 2260 2270 2280 

* * * * * * 

synlT . mze AGCAAGCTGAAGGCCTACACCCGCTACCAGCTGCGCGGCTACATCGAGGACAGCCAGGAC 

bssyn 

r synf ul. mod 

2290 2300 2310 2320 2330 2340 

# * * * * * 

? synlT . mze CTGGAGATCTACCTGATCCGCTACAACGCCAAGCACGAGACCGTGAACGTGCCCGGCACC 

bssyn — 

synf ul . mod 

2350 2360 2370 2380 2390 2400 

* « * * * * 

synlT . mze GGCAGCCTGTGGCCCCTGAGCGCCCCCAGCCCCATCGGCAAGTGCGCCCACCACAGCCAC 

bssyn 

synf ul . mod 
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FIG. 3 (CONT) 

24±u 2420 2430 244^ 2450 2460 

****** 

synlT . mze CACTTCAGCCTGGACATCGACGTGGGCTGCACCGACCTGAACGAGGACCTGGGCGTGTGG 

bssyn • 

synful.mod 

2470 2480 2490 2500 2510 2520 

* * * * * * 

synlT . m2e GTGATCTTCAAGATCAAGACCCAGGACGGCCACGCCCGCC7GGGCAACCTGGAG7TCCTG 

bssyn 

synful.mod 

2530 2540 2550 2560 2570 2580 

* * » * * * 

synlT . mze GAGGAGAAGCCCCTGG7GGGCGAGGCCCTGGCCCGCGTGAAGCGCGCCGAGAAGAAG7GG 

bssyn 

synf ul . mod 

2590 2600 2610 2620 2630 2640 

* ***** 

synlT . mze CGCGACAAGCGCGAGAAGCTGGAG7GGGAGACCAACATCGTGTACAAGGAGGCCAAGGAG 

zssyn . 

synf ul . mod 

Z55C 266? 267Q Z680 2690 27QC 

* ***** 

synlT . mze AGCG7GGACGCCC7G77CG7GAACAGCCAG7ACGACCGCCTGCAGGCCGACACCAACA7C 

bssyn 

synful.mod 

2710 2720 2730 2740 2750 2760 

* ***** 

s ynl7.nze GCCATGATCCACGCCGCCGACAJ^3CGCG7GCACAGCArCCGCGAGGCC7ACC7GCCCGAG 
bssyn : 

synfu-l.nod 7 

2770 2780 2790 2300 2810 282C 

* ***** 

synlT . mze C7GAGCG7GA7CCCCGGCG7GAACGCCGCCA7C77CGAGGAGC7GGAGGGCCGCA7C77C 

bssyn 

synful.mod 

2830 2840 2850 2860 2870 288C 

* * * » * * 

synlT. mze ACCGCC7TCAGCC7GTACGACGCCCGCAACG7GA7CAAGAACGGCGAC7TC.\ACAACGGC 
bssyn ( ., 

synful.mod 

2890 2900 2910 2920 2930 2940 

* * * * * * 

synl7 . mze C7GAGC7GC7GGAACG7GAAGGGCCACG7GGACG7GGAGGAGCAGAACAACCACCGCAGC 

bssyn 

synful.mod 

2950 2960 2970 2980 2990 3000 

* ***** 

synl7 . mze GTGCTGGTGGTGCCCGAGTGGGAGGCCGAGGTGAGCCAGGAGG7GCGCG7G7GCCCCGGC 

bssyn — 

synf ul . mod 

14/94 



• * WO 93/07278 PCT/ US 92/08476 

FIG. 3 (CONT) 

30. ^ 3020 3030 304 3050 3060 

.*«*** 

synlT . nze CGCGGCTACATCCTGCGCGTGACCGCCTACAAGGAGGGCTACGGCGAGGGCTGCGTGACC 

QSS y n " 

synf ul . mod 

3070 3080 3090 3100 3110 3120 

* ★ * * * * 
synlT . nze ATCCACGAGATCGAGAACAACACCGACGAGCTGAAGTTCAGCAACTGCGTGGAGGAGGAG 

bssyn • 

synf ui-% mod C 

3130 3140 3150 3160 3170 3180 

* * * * * * 

synlT . nze GTGTACCCCAACAACACCGTGACCTGCAACGACTACACCGCCACCCAGGAGGAGTACGAG 

bssyn 

synf ul . nod 

3190 3200 3210 3220 3230 3240 

****** 

synlT . r.2e GGCACCTACACCAGCCGCAACCGCGGCTACGACGGCGCCTACGAGAGCAACAGCAGCGTG 

bssyr. 

synf ul . mod < 

3 25C 3260 22^2 3280 329C 330C 

* * * * * * 

synlT . nze CCCGCCCACTACGCCAGCGCCTACGAGGAGAAGGCCTACACCGACGGCCGCCGCGACAAC 

bssyn 

synf ul. nod 

3310 3320 3330 3340 3350 3360 

****** 

synlT .r.ze CCCTGCGAGAGCAACCGCGGCTACGGCGACTACACCCCCC7GCCCGCCGGC7ACG7GACC 

bssyn =- 

£ : n: ul . .TiOC 

3370 3380 3390 3400 3410 342C 

****** 

synlT . nze AAGGAGCTGGAGTACTTCCCCGAGACCGACAAGGTGTGGATCGAGATCGGCGAGACCGAG 

bssyn 

synf ui . mod 

3430 3440 3450 3460 

* * * * 

synlT . raze GGCACCTTCATCGTGGACAGCGTGGAGCTGCTGCTGATGGAGGAGTAG 

bssyn : 

svnful.mod 
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FIG. 4 



x 0 20 30 -0 50 60 

****** 
BTHKURHD ATGGATAACAATCCGAACATCAATGAATGCATTCCTTATAATTGTTTAAGTAACCCTGAA 

PMONBT C CA C A. . C . .C . . C . . G A... 

bssyn C C..C CG C. .C. .C. .C. .CC.G. .C C. .G 

70 80 90 100 110 120 

* * * * * * 
3THKURHD GTAGAAGTATTAGGTGGAGAAAGAATAGAAACTGGTTACACCCCAATCGATA7TTCCTTG 

PMONBT . .T C.T CCT C T..C CC 

bssyn . .CCGCCCC.GCCC .CCC C C. .CACC . 

130 140 150 160 170 180 

* * * * * * 
BTHKURHD TCGCTAACGCAATTTCTTTTGAGTGAATTTGTTCCCGGTGCTGGATTTGTGTTAGGACTA 

PMONBT . . CT . G . . A . . G GC.CC.G..C.G..A G . . C . . TC . C 

bssyn AGC .G . .C .G. .C .GC CG. .CG C . .C . -C . . C . . . CG . .C . .G 

190 200 210 220 230 240 

****** 

3THKURHD GTTGATATAATATGGGGAATTTTTGGTCCCTCTCAATGGGACGCATTTCTTGTACAAATT 

PMONBT CCC T - . C A T C C G 

bssyn . . , G . . C . C . C C..C..C..C... AGC . . G CCG..G..CC 

250 260 270 230 290 3GC 

* * * * * * 

BTHKURHD GAACAG7TAATTAACCAAAGAATAGAAGAATTCGCTAGGAACCAAGCCATTTCTAGATTA 

PMONBT ..G G..C G..G..C G C G C C G 

bssyn . ,G. . .C.G. .C GC.C. .C. .G. . G CC.C G CAGCC.CC.G 

310 320 330 340 350 360 

* * * . * * * 

3THKURH2 GAAGGACTAAGCAAT "7TA7CAAA777ACGCAGAATC77T7AGAGAG7GGGAAGCAGA7 

PMONBT T.G C..C C..7 GAGC . . C C . . . 

bssy- ..G..C..G C G . . C C 7.. GAGC . . CC C Z . . C . . : 

370 380 390 400 410 427 

* * * * ■ * * 
BTHKURHD CCTACTAATCCAGCATTAAGAGAAGAGATGCG7ATTCAATTCAATGACATGAACAGTGCC 

PMONBT C TC . CC . C . . G . . A C C . 

bssyn . . C . .C . .C . .C . .CC .GC .C . .G C..C..G C C . 

430 440 450 460 470 480 

* * * * * * 
BTHKURHD CTTACAACCGCTATTCCTCTTTTTGCAGTTCAAAATTATCAAGTTCCTCTTTTATCAGTA 

PMONBT T . G . . C . . A C. . AT. G..C C. . C . C. . C C . . G . . C . G 

bssyn . .CC CCCG. .C.C.G.. CCCCC. .C .GC.G AGC. .G 

490 500 510 520 530 540 

* * * * * * 
BTHKURHD TATGTTCAAGCTGCAAATTTACATTTATCAGTTTTGAGAGATGTTTCAGTGTTTGGACAA 

PMONBT ..C A..T...C.T..CC. CAGC . . GC . TC . . . . C . . . AGC G . . . 

bssyn ..CCCCC CC.G. .CCGAGC. .GC. .CC. .C. .CAGC CC.G 

550 560 570 580 590 600 

* * * * * * 
BTHKURHD AGGTGGGGATTTGATGCCGCGACTATCAATAGTCGT^ATAATGATTTAACTAGGCTTATT 

PMONBT C T..A..C C C . C . CC . T C . 

bssyn CC CCC CC CCCCC CC.G. .CCCCC 
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610 620 630 S4u 650 660 

* ***** 

3THKUHHD GGCAACTATACAGATCATGCTGTACGCTGGTACAATACGGGATTAGAGCGTGTATGGGGA 

PMONBT ..A C. .C. -C. .C T..T C-.T..C..G C T 

bssvn C. . C. . C. .C. . C. . G C. .C. .CC.G C..G T 



3THKLTRHD 
*?MONBT 
bssyn 



3THKURHD 

PMONBT 

bssyn 



670. 680 690 700 710 720 

* * * ■* * ★ 

CCGGATTCTAGAGATTGGATAAGATATAATCAATTTAGAAGAGAATTAACACTAACTGTA 

..7 T C. .C. .G. .C. .G G..C..C..A..T 

. . C . . CAGCC . C . . C C . . G . . C . .C...G. . CC . CC . C . . GC . G . . C . . G . . C . . G 

730 740 750 760 770 780 

* * * * * * 

TTAGATATCGTTTCTCTATTTCCGAACTATGATAGTAGAACGTATCCAATTCGAACAGTT 

..G..C..T..G...'..C..C CTCC .....C..C..T. ...C . . T G 

C . G . . C GAGC . .G..C..C C..C..CC.Z..C..C..C..C..C..C..G 



3THKURHD 

PMONBT 

cssyr. 



790 
* 



300 



810 



320 



830 



340 



TCCCAATTAACAAGAGAAATTTATACAAACC2AGTATTAGAAAATTTTGATGGTAGTTTT 

C.T..C C T TC.Z..G..C..C..C C..C 

AG • . . GC . G . . CC . C . . G C , . C C . . GC . Z . . G . . C • . C . . C . . C. . C . . C 



350 360 S^C :50 590 90C 

* * * * * * 

3THKURHD CGAGGCTCGGCTCAGGGCATAGAAGGAAGTATTAGGAGTCCACATTTGATGGATATACTT 

PMONBT ..T..T..T..C..A..T..C CTCC. .C C C C..CT.G 

bssyn . .C. . .AGC. .C C..G..C..C..CC.C..C..C..CC C..C.G 



PMONBT 



910 920 930 940 950 960 

* * * * * * 

AACAGTATAACCATCTATACGGATGCTCATAGAGGAGAArAiTATTGGTCAGGGCATCAA 
C T C.GC C Z C T . . A . . C . . G 

r..c c c c cc .c c c . . . agc. . c. . z. .z 



970 980 990 LOGO 1010 102C 

* * * * * * 

27HKURHD ATAATGGCTTCTCCTGTAGGG7TTTCGGGGCCAGAATTCACTTTTCCGC7ATATGGAACT 

PMONBT . . C C A . . T . . A . . CAGC C..G..T..C T..C 

bssyn . . C CAGC . . C . . C . . C . . CAGC . . C . . C . - G C . . C . .C. . G . . C . ♦ C . . C 

1030 1040 1050 1060 1070 1080 

* * * * * * 

3THKURHD A7GGGAAATGCACC7CCACAACAACG7A7TC77GC7CAAC7AGG7CAGGGCG7G7A7AGA 

PMONBT .C. .C . . .C T. .C. .C. . . 

: bssyn C..C..7..A..T..G..G..C..C..G..A..G..G..C A CC.C 

1090 1100 1110 1120 1130 1140 

* * * * * * 

' 37HKURHD ACATTATCGTCCACTTTATATAGAAGACCTTTTAATATAGGGATAAATAATCAACAACTA 

PMONBT ..C..G..T C..G..C C. .C C..T..C..C..C..G T 

bssvn . . CC . GAGCAG . . . CC - G . .CC. TC C. . C . . C . . C . . C . .C. . C • .G..G..G 



1150 1160 1170 1130 1190 1200 

* * * * * * 

BTHKURHD TCTGTTCTTGACGGGACAGAATTTGCTTATGGAACCTCCTCAAATTTGCCATCCGCTGTA 

PMONBT . . C A G..C..C T..T..C T 

bssyn AGC . . G . . G C . .C..G..C..C..C..C. ..AG. AGC . . CC . . . . CAG . . . C . . G 
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BTHKURHD 

PMONBT 

bssvn 



FIG. 4 1CONT) 



12*J 1220 1230 124w 1250 1260 
****** 
TACAGAAAAAGCGGAACGGTAGATTCGCTGGATGAAATACCGCCACAGAATAACAACGTG 
\ . . G C . • T CT . . . . C C . . A C T . . . 



,C.C. .G C..C..G. .CAGC, 



3THKURHD 
•PMONBT 
bssvn - 



BTHKURHD 

PMONBT 

bssvn 



1270 1280 1290 1200 1310 1320 

* * * * * * 

CCACCTAGGCAAGGATTTAGTCATCGATTAAGCCATGTTTCAATGTTTCGTTCAGGCTTT 

C CTCC . . CA . G . . G C . . G . . C C C . - A . . C 

C.A..G. . C . . C . .C..C. .TC.G C . . GAGC C . . CAGT C 

1330 1340 1250 1360 1370 1380 

****** 

AGTAATAGTAGTGTAAGTATAATAAGAGCTCCTATGTTCTCTTGGATACATCGTAGTGCT 

. .C. .C. . .TCC. .G..C..C..C A T 

..C..C..C..C..G..C..C..CC.T..A AGC T..C..C C 



BTHKURHD 

PMONBT 

bssvn 



1390 



1400 



.410 
* 



1420 



1420 



GAATTTAATAATATAATTCCTTCATCA--CAAATTACACAAATACCTTT.\ACAAAATCTA 

..G..C..C C C..7-- C..C.....C..A..G..C..G,... 

..G..C..C..C..C..C..-- •..G..GC..G..C..C..G..C..CC.G..C. GAGC . 



BTHKURHD 

PMCNBT 

bssvn 



144? 1430 146: 147C 148C 1490 

* * » * * * 

CTAATCTTGGCTCTGGAACTTCTGTCGTTAAAGGACCAGGATTTACAGGAGGAGATATTC 

C A G C. .C 7 

. C . . C . . G . . . AGC . . C . . CAGC ..G..G..G..C..C..C..C..C..C..C..C'..C. 



1500 1510 1520 1530 1540 1550 

* * * * * * 

bthkurhd ttcg^gaacttcacctgcccagatttc^ccttaagagtaaatattactgcaccattat 
phonbt . . a t agc . . . c . c 7..c..c c . t . 

:=3vr, . G . . ZZ . Z . . GAGC . . Z GAGC . . . Z . GC .C..G..C..C..G..I.. GC . GA 



BTHKURHD 

PMONBT 

bssyn 



BTHKURHD 

PMONBT 

bssyn 



x BTHKURHD 
PMONBT 
bssyn 



BTHKURHD 

PMONBT 

bssyn 



1560 1570 1580 1590 1600 161 : 

* * * * * * 
CACAAAGATATCGGGTAAGAATTCGCTACGCTTCTACCAC,\AATTTAC.\ATTGCATACAT 
. T T..C..G T A T..C..G C. -C. 

' GC. .GC.C. .C. .C. .CC.C. .C CAGC C. .CC.G. .G C. .CA 

1620 1630 1640 1650 1660 1670 

****** 
CAATTGACGGAAGACCTATTAATCAGGGGAATTTTTCAGCAACTATGAGTAGTGGGAGTA 

. C . . C G C T..C..C..C C . . . TCA . C . . C . . C . 

GC.C CC.C..C..C..C C..C. CAGC . . C . . C C . . C . . C . . C . 

1680 1690 1700 1710 1720 1720 

* * * * * * 
ATTTACAGTCCGGAAGCTTTAGGACTGTAGGTTTTACTACTCCGTTTAAC 

.C..G..A C C..A..C..C C T..C. 

. CC . G . . . AG . ..C CC.C. . C . .G..C..C..C..C. .C. . C . . 



TCAAATG 
C . . T . • C . 
CAGC. .C. 



1740 



1760 



1770 



1780 



790 



1750 

* * * * » * 

GATCAAGTGTATTTACGTTAAGTGCTCATGTCTTCAATTCAGGCAATGAAGTTTATATAG 

C..T.-C..CC.T..C G T G..C..T. 

. CAGC . .C..G..C.CC.G..C..C..C..G CAGC C..G..G..C..C. 
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1800 310 1820 1830 . ^40 

* * # * r 

3THKURHD ATCGAATTGAATTTGTTCCGGCAGAAGTAACCTTTGAGGCAGAATA 

PMONBT . C . . T G G . . T . . C T C T • . G . . 

bss V n : C. .C. .C. .G. .C. . G . .C. .C. . G . .G C C. .g! ! CGACCTGGAGAGGG 



5THXURHD 

PMONBT 

bssyn CTCAGAAGGCCGTGAACGAGCTGTTCACCAGCAGCAACCAGATCGGCCTGAAGACCGACG 



57HKURHD t 

PMONBT C 

-ssyn TGACCGACTACCACATCGATCAGGTGTAG 
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PMONBT 
bssyn 



PMONBT 
bssyn 



FIG . 5 

10 20 30 .0 50 60 

****** 

ATGGACAACAACCCAAACATCAACGAATGCATTCCATACAACTGCTTGAGTAACCCAGAA 
C G C .C C . . .C C. .G 

70 30 90 100 110 120 

****** 

GTTGAAGTACTTGGTGGAGAACGCATTGAAACCGGTTACACTCCCATCGACATCTCCTTG 
..C.C. C.C. C.C.G C..G C C AG.C. 



PMONBT 
bssvn 



130 140 150 160 170 180 

****** 

TCCTTGACACAGTTTCTGCTCAGCGAGTTCGTGCCAGGTGCTGGGTTCGTTCTCGGACTA 
AG . C . . . C C G C . . C . . C . . C C . G . . C. . G 



PMONBT 
bssyn 



240 
* 



190 200 210 220 230 

***** 

GTTGACATCATCTGGGGTATCTTTGGTCCATCTCAATGGGATGCATTCCTGGTGCAAATT 
. . G C C . . C . . CAGC . . G C.C C.C 



PMONBT 
bssvn 



250 260 270 280 290 300 

****** 

GAGCAGTTGATCAACCAGAGGATCGAAGAGTTCGCCAGGAACCAGGCCATCTCTAGGTTG 
C C.C G . . . C C AGCC . CC . . 



PMONBT 
bssyn 



310 320 330 340 350 360 

****** 

GAAGGATTGAGCAATCTCTACCAAATCTATGCAGAGAGCTTCAGAGAGTGGGAAGCCGAT 
. . G. . CC C..G C.C CC G . . . C 



=mcn=: 

bssvn 



370 380 390 400 410 420 

****** 

:CTAC7>^CCCAGCTZTCCCCGAG3A^.7GCGTATTC.\ATTC^CGACArC^:AGCGCC 
.C.C C .C .G G C .C. . G 



PMONBT 
bssvn 



430 



440 



450 



4 60 
* 



480 



TTGACCACAGCTATCCCATTGTTCGCAGTCCAGAACTACCA.\GTTCCTCTCTTGTCCGT: 
C C.C CC C - G G . . C . C -. GC. . AG . . • 



PMONBT 
bssyn 



490 500 510 520 530 540 

* * * * •» •* 

:acgttcaagcagctaatcttcacctcagcgtgcttcgagacgttagcgtgtttgggcaa 
g. .g. .c. .c. .c. .g g g.-c c c.c.g 



PMONBT 
bssyn 



550 560 570 580 590 600 

****** 

AGGTGGGGATTCGATGCTGCAACCATCAATAGCCGTTACAACGACCTTACTAGGCTGATT 
CC C C.C.C C C G. .CC.C C 



PMONBT 
bssyn 



610 



620 



630 



640 



650 
* 



660 



GGAAACTACACCGACCACGCTGTTCGTTGGTACAACACTGGCTTGGAGCGTGTCTGGGGT 
. .C C.C.C C . .C C . G 



PMONBT 
bssyn 



670 630 690 700 710 720 

* * * * * * 

CCTGATTCTAGAGATTGGATTAGATACAACCAGTTCAGGAGAGAATTGACCCTCACAGTT 
. . C . CAGCC .C.C C . G C.CC.C..GC. C.C.G 
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PMONBT. 
bssyn 



PMONBT 
bssvn 



■ 0 740 750 7^ 770 780 

* * * * * * 

TTGGACATTGTGTCTCTCTTCCCGAACTATGACTCCAGAACCTACCCTATCCGTACAGTG 
C C . . . AGC . . G C C . . . AG . C . C C C.C... 



790 300 310 820 330 340 

* * * * * » 

TCCCAACTTACCAGAGAAATCTATACTAACCCAGTTCTTGAGAACTTCGACGGTAGCTTC 
AG. .. C. C. .CC. C.T. .C.C C..G..G r 



PMONBT 
bssyn 



850 360 370 380 390 900 

* * *, * * * 

CGTGGTTCTGCCCAAGGTATCGAAGGCTCCATCAGGAGCCCACACTTGATGGACATCTTG 
. . C . CAGC G..C C . . AG . . . . C. C C...C c . . 



PMONBT 
bssyn 



pmcnbt 

bssvn 



PMONBT 
bssyn 



.-mine: 
bssvn 



: MCNBT 
rssvn 



PMONBT 
bssyn 



910 920 930 940 950 960 

* * ★ * * * 

AACAGCATAACTATCTACAGCGATGCTCACAGAGGAGAGTATTACTGGTCTGGACACCAG 
C.C C...C..C...C.C..C C AGC . . C 

970 980 990 1000 1010 1020 

<* * * * * * 

A7CATGGCCTCTCCAGTTGGATTCAGCGGGCCCGAG7TTACCTTTCCTCTCTATGGAACT 
AGC . . C . C . C C C C.C. C.C. C.C 



1030 



1080 



L040 1050 1060 1070 

* * * * * 

ATGGGAAACGCCGCTCCACAACAACGTATCGTTGCTCAACTAGGTCAGGGTGTCTACAGA 
C T..A..T..C.C.C G..A..G..G..C A. .C . .CC 



1140 



1090 1100 1110 1120 1130 

****** 

ACCTTGTCTTCCACCTTGTACAGAAGACCCTTC.^ATATCGGTATCAACAACCAGCAACTT 
. . . C . AGCAG . . . . C C.TC....T C C G..G 



200 



1150 1160 1170 1180 1190 

* * * * * 

7CCGTTCTTGACGGAACAGAGTTCGCCTATGGAACCTCTTCTAACTTGCCATCCGCTGTT 
AG . . . C . G C.C C.C. .AGCAGC . .C CAG. ..C.C 

1210 1220 1230 1240 1250 1260 

* * * * * * 

TACAGAAAGAGCGGAACCGTTGATTCCTTGGACGAAATCCCACCACAGAACAACAATGTG 
• C.C C C. CAG . C G C.T C.. 



PMONBT 
bssyn 



1270 1280 . 1290 1300 1310 1320 

* * * * * * 

CCACCCAGGCAAGGATTCTCCCACAGGTTGAGCCACGTGTCCATGTTCCGTTCCGGATTC 
TCA. .G, .C. . .AC . . .CTC AG CAGT . .C . . 



PMONBT 
bssyn 



PMONBT 
bssyn 



1330 1340 1350 1360 1370 1380 

* * * * * * 

AGCAACAGTTCCGTGAGCATCATCAGAGCTCCTATGTTCTCATGGATTCATCGTAGTGCT 
CAG c.T. .A AGC C.C C 

1390 1400 1410 1420 1430 1440 

* * * * * * 

GAGTTCAAC.\ATATCATTCCTTCCTCTCAAATCACCCAAATCCCATTGACCAAGTCTACT 
C C . CAG . AGC . . G G CC AGC . . C 

21/94 



WO 93/07278 



PCT/US92/08476 



! MONBT 
issvn 



(CONT) 



U.j 1460 1470 14bu 1490 1500 

* • * * * * 

AACC77GGA7C7GGAACT7C7G7CG7GAAAGGACCAGGC77CACAGGAGG7GA7A7TCTT 
G . .CAGC. .C. .CAGC. . G G..C..C C..C..C..C..C..G 



: MONB1 
issyn 



; MONBT 
:ssvr. 



1510 1520 1520 1540 1550 1560 

★ * * * * * 

AGAAGAACTTCTCCTGGCCAGATTAGCACCCTCAGAGTTAACATCACTGCACCACTTTCT 
C . CC . C . . CAGC . . C C GC . C . . G . . C . . C . . C. . GAGC 



1570 



1580 



.590 



1600 



1610 
* 



1620 



CAAAGATATCGTGTCAGGATTCG7TACGCATCTACCACTAACTTGCAATTCCACACCTCC 
..GC.C..C..C...C.C..C..C CAGC C ... C .... G AG . 



=MONBT 
:ssvn 



1630 1640 1650 1660 1670 1680 
* « * * » 
ATCGACGGAAGGCCTATCAATCAGGGTAACTTCTCCGCAACCATG7CAAGCGGCAGCAAC 
CC . C . . C C C AG ... C AGC 



1690 



1700 



1720 



1730 



1740 



:ssvn 



77GCAATCCGGCAGCTTCAGAACCG7CGG7TTCAC7AC7CC7T7CAAC77C7C7AACGGA 
C . . . . GAG . . . C . C G..C C..C..C AGC C 



?mgne: 

cssvn 



1750 1760 17TQ 173.0 1790 1800 

* * * t * * 

7CAAGCG7TTTCACCC7TAGCGC7CA7G7G7TCAATTCTGGCAATGAAG7G7ACA7TGAC 
AGC G G C..C CAGC C..G C... 



.310 



1820 



13 jO 



.840 



ZG7ATT3AG77737GCC7GCwGAAv^77ACCTTi- -Awwv. * sjAo a A — "*"™"""" — — — — 

. . C . . C C C G..G C CG ACCTGG AG AGGGC7 



=M2sa: 



CAGAAGGCCGTGAACGAGC7G7TCACCAGCAGCAACCAGA7CGGCC7GAAGACCGACGTG 



?MCNB7 
cssyn 



ACCGAC7ACCACA7CGA7CAGG7G7AG 
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64 ATGGACCTGC TGCCCGACGC CCGCATCGAG GACAGCCTGT GCATCGCCGA GGGCAAGAAC 
MetAspLeu LeuProAsp AlaArglleGlu AspSerLeu CysIleAla GluGI y AsnAsn 

124 ATCGACCCCT TCGTGAGCGC CAGCACCGTG CAGACCGGCA 7CAACATCGC CGGCCGCATC 
IleAspPro PheValSer AlaSerThrVal GlnThrGly IleAsnlle AlaGlyArglle 

184 CTGGGCGTGC TGGGCGTGCC CTTCGCCGGC CAGCTGGCCA GCTTCTACAG CTTCCTGGTG 
LeuGlyVal LeuGlyVal ProPheAlaGly GlnLeuAla SerPheTyr SerPheLeuVal 

244 GGCGAGCTGT GGCCCCGCGG CCGCGACCAG TGGGAGATCT TCCTGGAGCA CGTGGAGCAG 
GlyGluLeu TrpProArg GlyArgAspGln TrpGluIle PheLeuGlu HisValGluGlu 

304 CTGATCAACC AGCAGATCAC CGAGAACGCC CGCAACACCG CCCTGGCCCG CC7GCAGGGC 
LeuIleAsn GlnGlnlle ThrGluAsnAla ArgAsnThr AlaLeuAla ArgLeuGlnGIy 

364 CTGGGCGACA GCTTCCGCGC CTACCAGCAG AGCCTGGAGG ACTGGCTGGA GAACCGCGAC 
LeuGlyAsp SerPheArg AlaTyrGlnGln SerLeuGlu AspTrpLeu GluAsnArgAsp 

424 GACGCCCGCA CCCGCAGCGT GCTGTACACC CAGTACATCG CCCTGGAGCT GGACTTCCTG 
AspAlaArg ThrArgSer ValLeuTyrThr GinTyrlle AlaLeuGlu LeuAspPheLeu 

484 AACGCCATGC CCCTGTTCGC CATCCGCAAC CAGGAGGTGC CCCTGCTGAT GGTGTACGCC 
AsnAlaMet ProLeuPhe AlalleArgAsn GlnGluVal ProLeuLeu MetValTyrAla 

544 CAGGCCGCCA ACCTGCACCT GCTGCTGCTG CGCGACGCCA GCCTGTTCGG CAGCGAGTTC 
GlnAlaAla AsnLeuHis LeulreuLeuLeu ArgAspAia SerLeuPhe GlySerGluPhe" 

604 GGCC7GACCA GCCAGGAGAT CCAGCGCTAC TACGAGCGCC AGGTGGAGCG CACCCGCGAC 
GlyLeuThr SerGlnGlu IleGlnArgTyr TyrGluArg GlnValGlu ArgThrArgAsp 

664 TACAGCGACT ACTGCGTGGA GTGGTACAAC ACCGGCCTGA ACAGCCTGCG CGGCACCAAC 
TyrSerAsp TyrCysVal GluTrpTyrAsn ThrGlyLeu AsnSerLeu ArgGIyThrAsn 

724 GCCGCCAGCT GGGTGCGCTA CAACCAGTTC CGCCGCGACC TGACCCTGGG CGTGCTGGAC 
AlaAlaSer TrpValArg TyrAsnGlnPhe ArgArgAsp LeuThrLeu GlyValLeuAsp 

784 CTGG7GGCCC TGTTCCCCAG CTACGACACC CGCACCTACC CCATCAACAC CAGCGCCCAG 
LeuValAla LeuPhePro SerTyrAspThr ArgThrTyr ProIleAsn ThrSerAIaGLo 

844 CTGACCCGCG AGGTGTACAC CGACGCCATC GGCGCCACCG GCG7GAACAT GGCCAGCATG 
LeuThrArg GluValTyr ThrAspAlalle GlyAlaThr GlyValAsn *SetAlaSerHet 

904 AACTGGTACA ACAACAAC5C CCCCAGCTTC AGCGCCATCG AGGCCGCCGC CATCCGCAGC 
AsnTrpTyr AsnAsnAsn AlaProSerPhe SerAlalle GluAlaAla AlalleArgSer 

964 CCCCACCTGC TGGACTTCCT GGAGCAGC7G ACCATCTTCA GCGCCAGCAG CCGCTGGAGC 
ProflisLeu LeuAspPhe LeuGluGlnLeu ThrllePhe SerAlaSer SerArgTrpSer 

1024 AACACCCGCC ACATGACCTA CTGGCGCGGC CACACCATCC AGAGCCGCCC CATCGGCGGC 
AsnThrArg HisMetThx TyrTrpArgGly HisThrlle GlnSerArg ProIleGlyGly 
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1084 GGCCTGAACA CCAGCACCCA CGGCGCCACC AACACCAGCA 7CAACCCCG7 GACCCTGCGC 
GiyLeuAsn ThrSerThr HisGlyAlaThr AsnTIirSer IleAsnPro ValThrLeuArg 

1144" TTCGCCAGCC GCGACGTGTA CCGCACCGAG AGCTACGCCG GCGTGCTGCT GTGGGGCATC 
PheAlaSer ArgAspVal TyrArgThrGlu SerTyrAla GlyValLeu LeuTrpGIylle 

1204 TACCTGGAGC CCATCCACGG CG7GCCCACC GTGCGCTTCA AC7TCACCAA CCCCCAGAAC 
TyrLeuGlu ProIleHis GlyValProThr ValArgPhe AsnPheThr AsnProGlnAsn 

1264 ATCAGCGACC GCGGCACCGC CAAC7ACAGC CAGCCCTACG AGAGCCCCGG CCTGCAGCTG 
IleSerAsp ArgGlyThr AlaAsnTyrSer GlnProTyr GluSerPro GlyLeuGlnLeu 

1324 AAGGACAGCG AGACCGAGC7 GCCCCCCGAG ACCACCGAGC GCCCCAAC7A CGAGAGCTAC 
LysAspSer GluThrGlu LeuProProGlu ThrThrGlu ArgProAsn TyrGluSerTyr 

1384 AGCCACCGCC TGAGCCACAT CGGCATCATC CTGCAGAGCC GCGTGAACGT GCCCGTGTAC 
SerHisArg LeuSerHis IleGlyllelle LeuGlnSer ArgValAsn ValProValTyr 

4 44 AGCTGGACCC ACCGCAGCGC CGACCGCACC AACACCATCG GCCCCAACCG CATCACCCAG 
SerTrpThr HisArgSer AlaAspArgThr AsnThrlle GlyProAsn ArglleThxGln 

1504 ATCCCCATGG TGAAGGCCAG CGAGC7GCCC CAGGGCACCA CCGTGGTGCG CGGCCCCGGC 
IleProHet: ValLysAla SerGluLeuPro GlnGlyTkr ThrValVal ArgGlyProGly 

1564 TTCACCGGCG GCGACATCC7 GCGCCGCACC AACACCGGCG GCTTCGGCCC CATCCGCGTG 
PheThrGly GlyAspIle LeuArgArgThr AsnThrGly GlyPheGly ProIleArgVai 

1624 ACCGTGAACG GCCCCC7GAC CCAGCGC7AC CGCATCGGCT TCCGC7ACGC CAGCACCG7G 
ThrValAsn GlyProLeu ThrGlnArg7yr ArglleGly PheArgTyr AlaSerThrVal 

1684 GAC7TCGAC7 7C77CG7GAG CCGCGGCGGC ACCACCG7GA ACAAC77CCG C77CC7GCGC 
AspPheAsp PhePheVal SerAxgGlyGly 7hr7hrVal AsnAsnPhe ArgPheLeuArg 

1744 ACCATGAACA GCGGCGACGA GC7GAAG7AC GGCAAC77CG 7GCGCCGCGC C77CACCACC 
7hrMetAsn SerGlyAsp GluLeuLys7yr GlyAsnPhe ValArgArg AlaPheThrThr 

1804 CCC77CACC7 TCACCCAGA7 CCAGGACA7C A7CCGCACCA GCA7CCAGGG CC7GAGCGGC 
ProPheThr Phe7hrGln IleGlnAspIle IleArgThr SerlleGIn GlyLeuSerGly 

1864 AACGGCGAGG 7G7ACA7CGA CAAGATCGAG A7CA7CCCCG 7GACCGCCAC C77CGAGGCC 
AsnGlyGlu Val7yrlle AspLysIleGlu IlellePro Val7tirA2a 7hrPheGluAla 

1924 GAG7ACGACC 7GGAGCGCGC CCAGGAGGCC G7GAACGCCC 7G77CACCAA CACCAACCCC 
Glu7yrAsp LeuGluArg AlaGlnGluAIa ValAsnAla LeuPhe7hr AsnThrAsnPro 

1984 CGCCGCC7GA AGACCGACG7 GACCGAC7AC CACATCGACC AGG7GAGCAA CC7GG7GGCC 
ArgArgLeu LysThrAsp Val7hrAspTyr HisIleAsp GlnValSer AsnLeuValAla 

204 4 TGCC7GAGCG ACGAG77C7G CC7GGACGAG AAGCGCGAGC 7GC7GGAGAA GG7GAAG7AC 
CysLeuSer AspGluPhe CysLeuAspGlu LysArgGlii LeuLeuGlu LysValLysTyr 
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2104 

2164 

2224 

2284 

2344 

2404 

2464 

2524 

2584 

2644 

2704 

2764 

2824 

2884 

2944 

3004 



GCCAAGCGCC 7GAGCGACGA GCGCAACC7G CTGCAGGACC CCAACTTCAC CAGCATCAAC 
AlaLysArg LeuSerAsp GluArgAsnLeu LeuGlnAsp ProAsnPhe ThrSerlleAsn 

AAGCAGCCCG ACTTCATCAG CACCAACGAG CAGAGCAACT TCACCAGCAT CCACGAGCAG 
LysGlnPro AspPhelle SerThrAsnGiu GinSerAsn PheThrSer IleflisGluGLn 

AGCGAGCACG GCTGGTGGGG GAGCGAGAAC ATCACCATCC AGGAGGGCAA CGACGTGTTC 
SerGluHxs GlyTrpTrp GlySerGluAsn IleThrlle GlnGluGly AsnAspValPhe 

AAGGAGAACT ACGTGACCCT GCCCGGCACC TTCAACGAGT GCTACCCCAC CTACCTGTAC 
LysGluAsn TyrValThr LeuProGlyThr PheAsnGlu CysTyrPro ThrTyrLeuTyr 

CAGAAGATCG GCGAGAGCGA GCTGAAGGCC TACACCCGCT ACCAGCTGCG CGGCTACATC 
GlnLys lie GlyGluSer GluLeuLysAla TyrThrArg TyrGlnLeu ArgGlyTyrlle 

GAGGACAGCC AGGACC7GGA GA7C7ACC7G ATCCSC7ACA ACGCCAAGCA CGAGACCC7G 
GluAspSer GlnAspLeu GiuIleTyrLeu IleArgTyr AsnAlaLys HisGluThrLeii 

GACGTGCCCG GCACCGAGAG CCTGTGGCCC CTGAGCGTGG AGAGCCCCAT CGGCCGCTGC ' 
AspValPro GlyThrGlu SerLeuTrpPro LeuSerVal GluSerPro IleGlyAxgCys ' 

GGCGAGCCCA ACCGC7GCGC CCCCCACTTC GAGTGGAACC CCGACCTGGA CTGCAGCTGC 
GlyGluPro AsnArgCys AlaProHisPhe GluTrpAsn ProAspLeu AspCysSerCys 

CGCGACGGCG AGAAG7GCGC CCACCACAGC CACCACTTCA GCC7GGACAT CGACGTGGGC 
AxgAspGly GluLysCys AlafiisHisSer HisHisPtie SerLeuAsp IleAspValGly 

TGCACCGACC TGCACGAGAA CC7GGGCGTG TGGGTGGTG7 TCAAGATCAA GACCCAGGAG 
CysThrAsp LeuHisGlu AsnLeuGlyVal TrpValVal PheLysIle LysThrGlnGlu 

GGCCACGCCC GCCTGGGCAA CCTGGAGTTC ATCGAGGAGA AGCCCC7GCT GGGCGAGGCC 
GxyHisAla ArgLeuGly AsnLeuGluPhe IleGluGlu LysProLeu LeuGlyGluAla 

C7GAGCCGCG 7GAAGCGCGC CGAGAAGAAG 7GGCGCGACA AGCGCGAGAA GCTGCAGCTG 
LeuSerAxg ValLysArg AlaGluLysLys 7rpAxgAsp LysArgGlu LysLeuGlnLeu 

GAGACCAAGC GCG7G7ACAC CGAGGCCAAG GAGGCCG7GG ACGCCC7G77 CGTGGACAGC 
GluThxLys ArgVal7yr 7hxGluAiaLys GluAlaVai AspAlaLeu PheVaUVspSer 

CAGTACGAC Z GCC7GCAGGC CGACACCAAC A7CGGCA7GA TCCACGCCGC CGACAAGCTG 
Gxn7yrAsp ArgLeuGln AlaAspThrAsn IleGlyBet IleflisAla AlaAspLysLeu 

G7GCACCGCA TCCGCGAGGC C7ACC7GAGC GAGCTGCCCG 7GA7CCCCGG CG7GAACGCC 
valHisArg IleArgGlu AlaTyrLeuSer GluLeuPro ValllePro GlyValAsnAla 

^ tG ^? C T? CG AGGAGC7GGA GGGCCACATC ATCACCGCCA TCAGCC7GTA CGACGCCCGC 
GxuIlePhe GluGluLeu GluGlyHisIle IleThrAla IleSerLeu 7yrAspAlaArg 
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3064 AACGTGGTGA AGAACGGCGA CTTCAACAAC GGCCTGACCT GC7GGAACG7 GAAGGGCCAC 
AsnVaJLVal LysAsnGly AspPheAsnAsn GlyLeuThr CysTrpAsn ValLysGlyHis 

3124 G7GGACG7GC AGGAGAGCCA CCACCGCAGC GACC7GG7GA 7CCCCGAG7G GGAGGCCGAG 
ValAspVal GlnGlnSer HisHisArgSer AspLeuVal IieProGlu TrpGIuAlaGlu 

3184 GTGAGCCAGG CCGTGCGCGT GTGCCCCGGC TGCGGCTACA TCCTGCGCGT GACCGCCTAC 
ValSerGln AlaValArg VaJLCysProGly CysGlyTyr IleLeuArg ValThxAlaTyr 

324 4 AAGGAGGGCT ACGGCGAGGG CTGCG7GACC ATCCACGAGA TCGAGAACAA CACCGACGAG 
LysGluGly TyrGlyGlu GlyCysVal7kr IleflisGlu IleGluAsn AsnThxAspGlu 

3304 CTGAAGTTCA AGAACCGCGA GGAGGAGGAG GTGTACCCCA CCGACACCGG CACCTGCAAC 
LeuLysPhe LysAsnArg GluGluGluGlu ValTyrPro ThrAspThr GlyThrCysAsa 

3364 GAC7ACACCG CCCACCAGGG CACCGCCGGC TGCGCCGACG CC7GCAACAG CCGCAACGCC 
AspTyrThr AlaHisGln GlyThrAlaGly CysAlaAsp AlaCysAsn SerArgAsnAla 

34 24 GGCTACGAGG ACGCCTACGA GGTGGACACC ACCGCCAGCG TGAACTACAA GCCCACCTAC 
GlyTyrGlu AspAlaTyr GluValAspThr ThrAlaSer VaJLAsnTyr LysProThrTyr 

34 84 GAGGAGGAGA CCTACACCGA CGTGCGCCGC GACAACCACT GCGAGTACGA CCGCGGCTAC 
GluGluGlu ThrTyrThr AspValArgArg AspAsnflis CysGluTyr AspArgGlyTyr 

3544 GTGAACTACC CCCCCGTGCC CGCCGGC7AC G7GACCAAGG AGCTGGAGTA C77CCCCGAG 
ValAsnTyr ProProVal ProAIaGlyTyr ValThrLys GluLeuGlu TyrPbeProGlu 

3604 ACCGACACCG TG7GGATCGA GATCGGCGAG ACCGAGGGCA AGTTCATCGT GGACAGCG7G 
ThrAspThr ValTrpIle GluIleGlyGlu ThrGluGly LysPhelle ValAspSerVal 

3664 GAGCTGC7GC 7GA7GGAGGA G7AG 
GluLeuLeu LeuMetGlu Glu 
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FIG. 6 



64 ATGGACCTGC 7GCCCGACGC CCGCATCGAG GACAGCCTGT GCATCGCCGA GGGCAACAAC 
MetAspLeu LeuProAsp AlaArglleGlu AspSerLeu CysIleAla GluGiyAsnAsn 

124 A7CGACCCC7 7CG7GAGCGC CAGCACCG7G CAGACCGGCA 7CAACATCGC CGGCCGCATC 
IleAspPro PheValSer AlaSerThrVal GlnThrGly IleAsnlle AlaGlyArglle 

184 CTGGGCGTGC TGGGCGTGCC CTTCGCCGGC CAGCTGGCCA GCTTCTACAG CTTCCTGGTG 
LeuGlyVal LeuGlyVal ProPheAlaGly GlnLeuAla SerPheTyr SerPheLeuVal 

244 GGCGAGCTGT GGCCCCGCGG CCGCGACCAG TGGGAGATC7 TCCTGGAGCA CGTGGAGCAG 
GlyGluLeu TrpProArg GlyArgAspGin TrpGluIle PheLeuGlu HisValGluGln 

304 C7GA7CAACC AGCAGATCAC CGAGAACGCC CGCAACACCG CCCTGGCCCG CCTGCAGGGC 
LeuIleAsn GlnGlnlle ThrGluAsnAla ArgAsnThr AlaLeuAla ArgLeuGlnGly 

3 64 CTGGGCGACA GCTTCCGCGC CTACCAGCAG AGCCTGGAGG ACTGGCTGGA GAACCGCGAC 

LeuGlyAsp SerPheArg AlaTyrGlnGln SerLeuGlu AspTrpLeu GluAsnArgAsp 

424 GACGCCCGCA CCCGCAGCG7 GCTGTACACC CAGTACATCG CCCTGGAGCT GGACTTCCTG 
AspAlaArg ThrArgSer ValLeuTyrThr GlnTyrlle AlaLeuGlu LeuAspPheLeu 

4 84 AACGCCATGC CCCTGTTCGC CATCCGCAAC CAGGAGGTGC CCCTGCTGAT GGTGTACGCC 

AsnAlaMet ProLeuPhe AlalleArgAsn GlnGluVal ProLeuLeu MetValTyrAla 

544 CAGGCCGCCA ACCTGCACC7 GC7GC7GC7G CGCGACGCCA GCC7G77CGG CAGCGAG77C 
GinAlaAla AsnLeuHis LeuLeuLeuLeu Ax g Asp Ala SerLeuPhe GlySerGluPhe 

604 GGCC7GACCA GCCAGGAGA7 CCAGCGC7AC TACGAGCGCC AGG7GGAGCG CACCCGCGAC 
GlyLeuThr SerGlnGlu I leGlnAxgTy r TyrGluArg GlnValGlu ArgThxArgAsp 

664 7ACAGCGACT AC7GCG7GGA G7GGTACAAC ACCGGCC7GA ACAGCCTGCG CGGCACCAAC 
7yrSerAsp 7yrCysVal GluTrpTyrAsn 7hrGlyLeu AsnSerLeu ArgGlyTlrrAsn 

724 GCCGCCAGC7 GGG7GCGC7A CAACCAG77C CGCCGCGACC 7GACCC7GGG CG7GC7GGAC 
AlaAlaSer TrpValArg 7yrAsnGlnPhe ArgArgAsp LeuThrLeu GlyValLeuAsp 

784 C7GG7GGCCC 7GT7CCCCAG CTACGACACC CGCACC7ACC CCA7CAACAC CAGCGCCCAG 
LeuValAla LeuPhePro Ser7yrAsp7hr ArgThrTyr ProIleAsn 7hrSerAiaGln 

844 C7GACCCGCG AGG7G7ACAC CGACGCCA7C GGCGCCACCG GCG7GAACA7 GGCCAGCA7G 
LeuThxArg GluValTyr 7hrAspAlaIle GlyAIaThr GiyValAsn MetAiaSerMet 

904 AACTGGTACA ACAACAACGC CCCCAGCTTC AGCGCCA7CG AGGCCGCCGC CA7CCGCAGC 
Asn7rpTyr AsnAsnAsn AlaProSerPhe SerAlalle GluAlaAla AlalleArgSer 

964 CCCCACCTGC 7GGAC77CC7 GGAGCAGC7G ACCA7C77CA GCGCCAGCAG CCGC7GGAGC 
ProHisLeu LeuAspPhe LeuGluGlnLeu TbxIlePhe SerAlaSer SerArgTrpSer 

1024 AACACCCGCC ACA7GACC7A C7GGCGCGGC CACACCA7CC AGAGCCGCCC CA7CGGCGGC 
AsnThxArg HisMetThr TyrTrpArgGly HisTiirlle GlnSerArg ProIleGlyGly 
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FIG . 6 (CON?] 



1084 GGCCTGAACA CCAGCACCCA CGGCGCCACC 
GlyLeuAsn ThrSerThr HisGlyAlaThr 

1144 TTCGCCAGCC GCGACG7G7A CCGCACCGAG 
PheAlaSer ArgAspVal TyrArgThrGlu 

1204 TACCTGGAGC CCATCCACGG CGTGCCCACC 
TyrLeuGlu ProIleHis GlyValProThr 

1264 ATCAGCGACC GCGGCACCGC CAACTACAGC 
IleSerAsp ArgGlyThr AlaAsnTyrSer 

1324 AAGGACAGCG AGACCGAGC7 GCCCCCCGAG 
LysAspSer GluThrGlu LeuProProGlu 

1384 . AGCCACCGCC TGAGCCACAT CGGCATCATC 
SerHisArg LeuSerflis IleGlyllelle 

4 44 AGCTGGACCC ACCGCAGCGC CGACCGCACC AACACCATCG GCCCCAACCG CATCACCCAG 
SerTrpThr HisArgSer AlaAspArgThr AsnThrlle GlyProAsn ArglleThrGln 

1504 ATCCCCATGG TGAAGGCCAG CGAGCTGCCC CAGGGCACCA CCGTGGTGCG CGGCCCCGGC 
IleProMet ValLysAla SerGluLeuPro GlnGlyThr ThrValVal ArgGlyProGly 

1564 TTCACCGGCG GCGACATCCT GCGCCGCACC AACACCGGCG GCTTCGGCCC CATCCGCGTG 
PheThrGly GlyAspIle LeuArgArgThr AsnThrGly GlyPheGly ProIleArgVal 

1624 ACCGTGAACG GCCCCCTGAC CCAGCGCTAC CGCATCGGCT TCCGCTACGC CAGCACCGTG 
ThrValAsn GlyProLeu ThrGlnArgTyr ArglleGly PheArgTyr AlaSerThrVal 

1684 GACTTCGACT TCTTCGTGAG CCGCGGCGGC ACCACCGTGA ACAACTTCCG C77CCTGCGC 
AspPheAsp PhePheVal SerArgGlyGly ThrThrVal AsnAsnPhe ArgPheLeuArg 

1744 ACCATGAACA GCGGCGACGA GCTGAAGTAC GGCAACTTCG TGCGCCGCGC CT7CACCACC 
ThrMetAsn SerGlyAsp GluLeuLysTyr GlyAsnPhe ValArgAxg AlaPheThrThr 

1804 CCC77CACC7 TCACCCAGAT CCAGGACATC ATCCGCACCA GCATCCAGGG CCTGAGCGGC 
ProPheThr PheThrGln IleGlnAspIle IleArgThr SerlleGln GlyLeuSerGly 

1864 AACGGCGAGG TGTACATCGA CAAGATCGAG ATCATCCCCG TGACCGCCAC CTTCGAGGCC 
AsnGlyGlu ValTyrlle AspLysIleGlu HellePro ValThrAla ThrPheGluAla 

1924 GAGTACGACC TGGAGCGCGC CCAGGAGGCC GTGAACGCCC TGTTCACCAA CACCAACCCC 
GluTyrAsp LeuGluArg AlaGlnGluAla ValAsnAla LeuPheThr AsnThrAsnPro 

1984 CGCCGCC7GA AGACCGACGT GACCGACTAC CACA7CGACC AGG7GAGCAA CCTGGTGGCC 
ArgArgLeu LysThrAsp ValTfcirAspTyr HisIleAsp GlnValSer AsnLeuValAla 

2044 TGCCTGAGCG ACGAG77C7G GCTGGACGAG AAGCGCGAGC TGCTGGAGAA GGTGAAGTAC 
CysLeuSer AspGluPhe CysLeuAspGlu LysArgGlu LeuLeuGlu LysValLysTyr 



AACACCAGCA TCAACCCCGT GACCCTGCGC 
AsnTiirSer IleAsnPro ValThrLeuArg 

AGC7ACGCCG GCG7GC7GC7 GTGGGGCATC 
SerTyrAla GlyValLeu LeuTrpGIylle 

GTGCGC7TCA ACTTCACCAA CCCCCAGAAC 
ValArgPbe AsnPheThx AsnProGlnAsn 

CAGCCCTACG AGAGCCCCGG CCTGCAGCTG 
GlnProTyr GluSerPro GlyLeuGlnLeu 

ACCACCGAGC GCCCCAACTA CGAGAGCTAC 
ThrThrGIu ArgProAsn TyrGluSerTyr 

CTGCAGAGCC GCG7GAACGT GCCCG7GTAC 
LeuGlnSer ArgValAsn ValProVal7yr 
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"IG. 5 (CON?) 



2104 GCCAAGCGCC 7GAGCGACGA GCGCAACCTG CTGCAGGACC CCAACTTCAC CAGCATCAAr 
AlaLysArg LeuSerAsp GluAxgAsnLeu LeuGlnAsp ProAsnPhe ThrSerlleAsn 

2164 AAGCAGCCCG ACTTCATCAG CACCAACGAG CAGAGCAAC7 TCACCAGCAT CCACGAGCAG 
LysGlnPro AspPhelle SerTiirAsnGlu GlnSerAsn PheThrSer IleflisGluGln 

222 4 AGCGAGCACG GCTGGTGGGG CAGCGAGAAC ATCACCATCC AGGAGGGCAA CGACGTGTTC 
SerGluflis GlyTrpTrp GlySerGluAsn IleThrlle GlnGluGly AsnAspValPhe 

2284 AAGGAGAACT ACG7GACCCT GCCCGGCACC TTCAACGAGT GCTACCCCAC CTACCTGTAC 
LysGluAsn TyrValThr LeuProGlyThr PheAsnGlu CysTyrPro 7hr7yr^uTyr 

2344 CAGAAGATCG GCGAGAGCGA GCTGAAGGCC TACACCCGC? ACCAGCTGCG CGGCTACATC 
GlnLysIle GlyGluSer GluLeuLysAla TyrThrAxg TyrGlnLeu ArgGly^yrlir 

2404 GAGGACAGCC AGGACCTGGA GATCTACCTG ATCCGCTACA ACGCCAAGCA CGAGACCCTG 
GluAspSer GlnAspLeu GluIleTyrLeu IleArgTyr AsnAlaLys HisGluThrSu 

2464 ^ T ^ CC = f^JCGAGAG CCTGTGGCCC C7GAGCGTGG AGAGCCCCA7 CGGCCGCTGC 
AspValPro GlyThrGlu SerLeuTrpPro LeuSerVal GluSerPro UeGiyArgCys 

2524 GGCGAGCCCA ACCGCTGCGC CCCCCACTTC GAGTGGAACC CCGACCTGGA CTGCAGCTGC 
GxyGluPro AsnArgCys AlaProHisPhe GluTrpAsn ProAspLeu AspCysSerCys 

2584 CGCGACGGCG AGAAGTGCGC CCACCACAGC CACCACTTCA GCCTGGACAT CGACGTGGGC 
AxgAspGxy GluLysCys AlaHisHisSer HisHisPhe SerLeuAsp IleAspValGly 

264 4 TGCACCGACC TGCACGAGAA CC7GGGCG7G 7GGG7GG7G7 7CAAGA7CAA GACCCAGGAG 
CysThrAsp LeuflisGlu Asnl^uGlyVal TrpValVal PheLysIle LysThrGlnGlu 

2704 GGCCACGCCC GCC7GGGCAA CC7GGAG7TC A7CGAGGAGA AGCCCC7GC7 GGGCGAGGCC 
^ ArgLeuGly AsnLeuGluPhe IleGluGlu LysProLeu LeuGlyGluAla 

2764 C7GAGCCGCG 7GAAGCGCGC CGAGAAGAAG TGGCGCGACA AGCGCSAGAA GC-GCAGCTG 
LeuSerArg ValLysArg AlaGluLysLys 7rpArgAsp LysArgGlu LysLeuGlnLeu 

2824 GAGACCAAGC GCGTGTACAC CGAGGCCAAG GAGGCCG TGG ACGCCCTGTT CG7GGACAGC 
GluThrLys AxgVal7yr ThrGluAlaLys GluAlaVal AspAlaLeu PheValAspSer 

2884 CAG7ACGACC GCCTGCAGGC CGACACCAAC ATCGGCATGA TCCACGCCGC CGACAAGCTG 
GlnTyrAsp AxgLeuGln AlaAspThrAsn IleGlyMet IleflisAla AlaAspLysLeu 

2944 GTGCACCGCA TCCGCGAGGC CTACCTGAGC GAGCTGCCCG TGATCCCCGG CGTGAACGCC 
valHisArg IleArgGlu AlaTyrLeuSer GluLeuPro ValllePro GlyValAsnATa 

3004 GAGA7C7TCG AGGAGC7GGA GGGCCACATC ATCACCGCCA TCAGCC7G7A CGACGCCCGC 
GxuIIePhe GluGluLeu GluGlyHisIle IleThxAla IleSerLeu TyrAspAlaArg 
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c 

3064 AACGTGGTGA AGAACGGCGA CTTCAACAAC GGCCTGACCT GC7GGAACG7 GAAGGGCCAC 
AsnValVal LysAsnGly AspPheAsnAsn GlyLeuThr CysTrpAsn ValLysGlyHis 

3124 GTGGACGTGC AGCAGAGCCA CCACCGCAGC GACC7GG7GA TCCCCGAGTG GGAGGCCGAG 
ValAspVal GlnGlnSer HisHisArgSer AspLeuVal IleProGlu TrpGluAlaGlu 

3184 GTGAGCCAGG CCGTGCGCGT GTGCCCCGGC TGCGGCTACA TCCTGCGCGT GACCGCCTAC 
ValSerGln AlaValArg ValCysProGly CysGlyTyr IleLeuArg ValThrAlaTyr 

3244 AAGGAGGGCT ACGGCGAGGG CTGCGTGACC ATCCACGAGA TCGAGAACAA CACCGACGAG 
LysGluGly TyrGlyGlu GlyCysValThr IleHisGlu IleGluAsn AsnThrAspGlu 

3304 CTGAAGTTCA AGAACCGCGA GGAGGAGGAG GTGTACCCCA CCGACACCGG CACCTGCAAC 
LeuLysPhe LysAsnArg GluGluGluGlu ValTyrPro ThrAspThr GiyThrCysAsn 

3364 GACTACACCG CCCACCAGGG CACCGCCGGC 7GCGCCGACG CCTGCAACAG CCGCAACGCC 
AspTyrThr AlaJHisGln GlyThrAlaGly CysAlaAsp AlaCysAsn SerArgAsnAIa 

3424 GGCTACGAGG ACGCCTACGA GGTGGACACC ACCGCCAGCG TGAACTACAA GCCCACCTAC 
GlyTyrGlu AspAlaTyr GluValAspThr TlirAlaSer ValAsnTyr LysProThrTyr 

3484 GAGGAGGAGA CCTACACCGA CGTGCGCCGC GACAACCACT GCGAGTACGA CCGCGGC7AC 
GluGluGlu ThrTyrThr AspValArgArg AspAsnHis CysGluTyr AspArgGlyTyr 

354 4 GTGAAC7ACC CCCCCGTGCC CGCCGGC7AC GTGACCAAGG AGCTGGAGTA C77CCCCGAG 
ValAsnTyr ProProVal ProAlaGlyTyr ValThrLys GluLeuGlu TyrPheProGlu 

3604 ACCGACACCG TGTGGATCGA GATCGGCGAG ACCGAGGGCA AGTTCATCG7 GGACAGCGTG 
ThxAspThr ValTrpIle GluIleGlyGlu ThrGluGiy LysPhelle ValAspSerVal 

3664 GAGC7GCTGC TGA7GGAGGA G7AG 
GluLeuLeu LeuMetGlu Glu 



30/94 



WO 93/07278 PCT/US92/08476 



SEQUENCE OF THE FULL-LENGTH HYBRID SYNTHETIC/NATIVE ' CRYIA(B) CHIMERIC GENE 
Tne fusion point, between the synthetic and native coding sequences i3 
indicated by a slash (/) in the sequence. 

1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CATCAGCCTG 
ValGluVal LeuGlyGly GluAxglleGlu ThrGlyTyr ThrProIle AspIleSerLen 

121 AGCCTGACCC AGTTCCTGCT GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GGTGCAGATC 
ValAspile IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
oluGlnLeu IleAsnGln ArglleGluGlu PheAiaArg AsnGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATCTAC GCCGAGAGCT TCCGCSAGTG GGAGGCCGAC 
GluGlyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACAT GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGluGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 CTGACCACCG CCATCCCCCT GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThxThr AlallePro LeuPheAlaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

4 81 TACGTGCAGG CCGCCAACC7 GCACCTGAGC GTGCTGCGCG ACGTCAGCGT GTTCGGCCAG 
TyrVaiGln AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer ValPheGlyGln 

541 CGC7GGGGCT TCGACGCCGC CACCATCAAC AGCCGCTACA ACGACCTGAC CCGCCTGATC 
ArgTrpGly PheAspAla AlaThrlleAsn SerArgTyr AsnAspLeu ThxArgLeuIle 

SOI GGCAACTACA CCGACCACGC CGTGCGCTGG TACAACACCG GCCTGGAGCG CGTGTGGGGT 
GlyAsnTyr ThrAspHis AlaValArgTrp TyrAsnThr GlyLeuGlu ArgValTrpGly 

661 CCCGACAGCC GCGACTGGAT CAGGTACAAC CAGTTCCGCC GCGAGCTGAC CCTGACCGTG 
ProAspSer ArgAspTrp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVal 

721 C7GGACATCG TGAGCCTGTT CCCCAACTAC GACAGCCGCA CCTACCCCAT CCGCACCGTG 
LeuAspIle ValSerLeu PheProAsnTyr AspSerArg ThrTyrPro IleArgThxVal 

- 781 AGCCAGCTGA CCCGCGAGAT TTACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGCTTC 

SerGlnLeu ThrArgGlu IleTyrThrAsn ProValLeu GluAsnPhe AspGlySerPne 

- 841 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC ATCCGCAGCC CCCACCTGAT GGACATCCTG 

ArgGlySer AlaGlnGly IleGluGlySer IleArgSer PrcHisLeu MetAspIleLeu 

901 AACAGCATCA CCATCTACAC CGACGCCCAC CGCGGCGAGT ACTACTGGAG CGGCCACCAG 
AsnSerlle ThrlleTyr ThrAspAlaHis AxgGiyGlu TyrTyrTrp SerGlyHisGln 
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FIG . 7 (CONT; 



961 ATCATGGCCA GCCCCG7CGG CTTCAGCGGC CCCGAG77CA CC7TCCCCC7 GTACGGCACC 
IieMetAla SerProVal GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 

1021 ATGGGCAACG CTGCACC7CA GCAGCGCA7C G7GGCACAGC TGGGCCAGGG AG7G7ACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle VaLAlaGln LeuGlyGln GlyVal7yrArg 

1081 ACCC7GAGCA GCACCC7G7A CCG7CGACC7 77CAACA7CG GCATCAACAA CCAGCAGC7G 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCG7GC7GG ACGGCACCGA G77CGCC7AC GGCACCAGCA GCAACC7GCC CAGCGCCG7G 
SerValLeu AspGly7hr GluPheAlaTyr Gly7iirSer SerAsnLeu ProSerAlaVal 

1201 7ACCGCAAGA GCGGCACCG7 GGACAGCC7G GACGAGA7CC CCCC7CAGAA CAACAACGTG 
7yrArgLys SerGly71ir ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

1261 CCACCTCGAC AGGGC77CAG CCACCG7C7G AGCCACG7GA GCA7G77CCG CAG7GGC77C 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCG7GAGCA7 CA7CCG7GCA CC7ATG77CA GC7GGA77CA CCGCAG7GCC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe Ser7rplle HisArgSerAla 

1381 GAG77CAACA ACATCATCCC CAGCAGCCAG ATCACCCAGA TCCCCC7GAC CAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln IieThrGln IleProLeu 7hrLysSerThr 

14 41 AACC7GGGCA GCGGCACCAG CG7GG7GAAG GGCZCCGGCT 7CACCGGCGG CGACATCCTG 
AsnLeuGiy SerGlyThr SerValValLys GlyProGly Phe7hrGly GlyAspIleLeu 

1501 CGCCGCACCA GCCCCGGCCA GA7CAGCACC C7GCGCG7GA ACA7CACCGC CCCCC7GAGC 
ArgArgThr SerProGly GlnlleSerThr LeuArgVal Asnlle7hr AlaProLeuSer 

1561 CAGCGC7ACC GCG7CCGCA7 CCGC7ACGCC AGCACCACCA ACC7GCAG77 CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAla Ser7hrThr AsnLeuGln PheHis7hrSer 

1621 A7CGACGGCC GCCCCA7CAA CCAGGGCAAC 77CAGCGCCA CCATGAGCAG CGGCAGCAAC 
IleAspGly ArgProIle AsnGlnGlyAsn PheSerAla ThrMetSer SerGlySerAsn 

1681 C7GCAGAGCG GCAGC7TCCG CACCG7GGGC 77CACCACCC CCT7CAAC77 CAGCAACGGC 
LeuGlnSer GlySerPhe ArgThrValGly Phe7hrThr ProPheAsn PheSerAsnGly 

; 1741 AGCAGCG7G7 7CACCC7GAG CGCCCACG7G 77CAACAGCG GCAACGAGG7 G7ACA7CGAC 
SerSerVal Phe7hrLeu SerAlaflisVal PheAsnSer GiyAsnGlu Val7yrIleAsp 

^ 1801 CGCATCGAG7 7CG7GCCCGC CGAGG7GACC 77CGAGGCCG AG7ACGACC7 GGAGAGGGCT 
ArglleGlu PheValPro AlaGluVal7hr PheGluAla Glu7yrAsp LeuGluArgAla 

18 61 CAGAAGGCCG TGAACGAGC7 G77CACCAGC AGCAACCAGA 7CGGCC7GAA GACCGACG7G 
GlnLysAla VaLAsnGlu LeuPhe7hrSer SerAsnGln IleGlyLeu Lys7hrAspVal 
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1921 ACCGACTACC ACATCGAT/CA AGTATCCAAT TTAGTTGAGT GTTTATCTGATGAATT7TGT 
ThrAspTyr HisIleAsp/GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 CTGGA7GAAA AAAAAGAATT GTCCGAGAAA GTCAAACATG CGAAGCGACT TAGTGATGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 

2041 CGGAATTTAC TTCAAGATCC AAACTTTAGA GGGATCAATA GACAACTAGA CCGTGGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArgGlyTrp 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGTT 
ArgGlySer ThrAspIle ThrlleGlnGly GlyAspAsp ValPheLys GluAsnTyrVal 

2161 ACGCTATTGG G7ACCTTTGA TGAG7GCTAT CCAACG7A77 7A7A7CAAAA AA7AGATGAG 
ThrLeuLeu Gly7hrPhe AspGluCysTyr Pro7hrTyr LeuTyrGln LysIleAspGlu 

2221 TCGAAATTAA AAGCCTA7AC CCG77ACCAA 77AAGAGGG7 A7A7CGAAGA TAGTCAAGAC 
SerLysLeu LysAlaTyr 7hrArgTyrGln LeuArgGiy 7yrIleGlu AspSerGlnAsp 

2281 TTAGAAATCT A777AA77CG C7ACAA7GCC AAACACGAAA CAG7AAATG7 GCCAGG7ACG- 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGlu ThrValAsn ValProGlyThr 

2341 GG77CCTTAT GGCCGCT7TC AGCCCCAAG7 CCAA7CGGAA AA7GTGCCCA TCATTCCCAT 
GlySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysAla HisHisSerHis 

24 01 CATTTCTCCT TGGACATTGA TGTTGGATGT ACAGACTTAA ATGAGGACTT AGGTGTATGG 
rfisPheSer LeuAspIle AspValGlyCys ThrAspLeu AsnGluAsp LeuGiyVal7rp 

24 61 G7GA7A7TCA AGATTAAGAC GCAAGATGGC CATGCAAGAC TAGGAAATCT AGAATTTCTC 
ValllePhe LysIleLys ThrGlnAspGly HisAlaArg LeuGlyAsn LeuGluPheLeu 

2521 GAAGAGAAAC CATTAGTAGG AGAAGCACTA GCTCGTGTGA AAAGAGCGGA GAAAAAATGG 
GluGluLys ProLeuVal GlyGluAlaLeu AlaArgVal LysArgAla GluLysLysTrp 

2581 AGAGACAAAC GTGAAAAATT GGAATGGGAA ACAAATATTG TTTATAAAGA GGCAAAAGAA 
ArgAspLys ArgGluLys LeuGluTrpGlu ThrAsnlle ValTyrLys GluAlaLysGlu 

2641 TCTGTAGATG CTTTATTTGT AAACTCTCAA TATGATAGAT TACAAGCGGA TACCAACATC 
SerVaLAsp AlaLeuPhe ValAsnSerGln TyrAspArg LeuGlnAla Asp ThrAsnlle 

2701 GCGATGATTC ATGCGGCAGA TAAACGCGTT CATAGCATTC GAGAAGCTTA TCTGCCTGAG 
AlaMetlle HisAlaAla AspLysArgVal HisSerlle ArgGluAla TyrLeuProGlu 

2761 CTGTCTGTGA TTCCGGGTGT CAATGCGGCT ATTTTTGAAG AATTAGAAGG GCGTATTTTC 
LeuSerVal IleProGly ValAsnAlaAla IlePheGlu GluLeuGlu GlyArgllePhe 

2821 ACTGCATTCT CCCTATATGA TGCGAGAAAT G7CATTAAAA ATGGTGATTT taatAATGGC 
ThrAlaPhe SerLeuTyr AspAlaArgAsn VallleLys AsnGlyAsp PheAsnAsnGly 
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2881 TTATCCTGC7 GGAACG7GAA AGGGCATGTA GATGTAGAAG AACAAAACAA CCACCG77CG 
LeuSerCys TrpAsnVal LysGlyHisVal AspValGlu GluGlnAsn AsnflisArgSer 

2941 G7CCTTGTTG TTCCGGAATG GGAAGCAGAA GTGTCACAAG AAG77CG7G7 CTGTCCGGGT 
ValLeuVal ValProGlu TrpGluAlaGlu ValSerGln GluValArg ValCysProGly 

3001 CGTGGCTATA 7CC77CG7G7 CACAGCGTAC AAGGAGGGAT ATGGAGAAGG TTGCG7AACC 
ArgGlyTyr IleLeuArg ValThxAlaTyr LysGluGly TyrGlyGlu GlyCysValThr 

3061 ATTCATGAGA TCGAGAACAA TACAGACGAA CTGAAGTTTA GCAAC7GTGT AGAAGAGGAA 
IleHisGlu IieGluAsn AsnThxAspGlu LeuLysPhe SerAsnCys ValGluGluGlu 

3121 G7A7A7CCAA ACAACACGG7 AACG7G7AA7 GA77A7AC7G CGAC7CAAGA AGAA7A7GAG 
Val7yrPro AsnAsnThr Val7hxCysAsn AspTyrThr Ala7hrGln GluGluTyrGlu 

3181 GG7ACG7ACA C77C7CG7AA 7CGAGGA7A7 GACGGAGCC7 A7GAAAGCAA 77C77C7G7A 
Gly7hrTyr 7hrSerArg AsnArgGlyTyr AspGlyAla TyrGluSer AsnSerSerVal 

3241 CCAGC7GA77 ATGCA7CAGC C7A7GAAGAA AAAGCATA7A CAGA7GGACG AAGAGACAAT 
ProAlaAsp 7yrAlaSer A2a7yrGluGlu LysAla7yr 7hrAspGly ArgArgAspAsn 

3301 CC77G7GAA7 C7AACAGAGG A7A7GGGGA7 7ACACACCAC 7ACCAGC7GG C7A7G7GACA 
ProCysGlu SerAsnArg Gly7yrGlyAsp 7yr7hrPro LeuProAla Gly7yrVal7hr 

3361 AAAGAA77AG AG7AC77CCC AGAAACCGA7 AAGG7A7GGA T7GAGA7CGG AGAAACGGAA 
LysGluLeu GluTyrPhe ProGlu7hrAsp LysValTrp IleGluIIe GlyGluThrGlu 

3421 GGAACA77CA 7CG7GGACAG CG7GGAA77A C77C77A7GG AGGAA7AA 
GlyThrPhe IleValAsp SerValGluJLeu LeuLeuMet GluGlu 
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1 A7GGACAACA ACCCCAACAT CAACGAG7GC A7CCCC7ACA- ACTGCCTGAG CAACCCCGAG 
MecAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGCTACA CCCCCATCGA CA7CAGCC7G 
ValGluVal LeuGlyGly GluArglleGlu ThrGlyTyr ThrProIle AspIieSerLeu 

121 AGCC7GACCC AG7TCC7GC7 GAGCGAGTTC GTGCCCGGCG CCGGC7TCGT GC7GGGCC7G 
SerLeuThr GlnPheLeu LeuSerGluPhe ValProGly AlaGiyP.he ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT CTTCGGCCCC AGCCAGTGGG ACGCCTTCCT GG7GCAGA7C 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuVaiGlnlie 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG 7TCGCCCGCA ACCAGGCCA7 CAGCC3CCTS 
GiuGinLeu IleAsnGln ArglieGluGiu PheAiaArg AsnGinAla IleSerArgLeu 

3C1 GAGGGCC7GA GCAACC7G7A CCAAA7C7AC GCCGAGAGC7 TCCGCGAG7G GGAGGCCGA: 
GiuGiyLeu SerAsnLeu 7yrGlnIleTyr AlaGluSer PheArgGlu 7r?GluAlaAs? 

361 CCCACCAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT 7CAACGACA7 GAACAGCGCZ 
ProThrAsn ProAlaLeu ArgGiuGluMet ArglleGln PheAsnAsp MetAsnSerAla 

421 C7GACCACCG CCA7CCCCC7 G77CGCCG7G CAGAAC7ACC AGG7GCCCC7 GC7GAGCG7G 
Leu7hr7hr AlallePro LeuPheAlaVai GlnAsnTyr GlnVaiPro LeuLeuSerVal 

4 81 TACG7GCAGG CCGCCAACC7 GCACC7GAGC G7GC7GCGCG ACG7CAGCG7 G77CGGCCA3 
7yrValGlr. AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer VaiPheGiyGl.-. 

341 CGC7GGGGC7 TCGACGCCGC CACCA7CAAC AGCCGC7ACA ACGACC7GAC CC3CC7GA7C 
ArgTrpGly PheAspAla Ala7hrIleAsn SerArgTyr AsnAspLeu ThrArgLeuIle 

601 GGCAAC7ACA CCGACCACGC CG7GCGC7GG 7ACAACACCG GCC7GGAGCG CGTG7GGGG7 
GlyAsn7yr 7hrAspHis AIaValArg7rp 7yrAsn7hr GlyLauGlu ArgValTrpGly 

661 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCG7G 
ProAspSer ArgAsp7rp IleArg7yrAsn GlnPheArg ArgGluLeu 7hrLeu7hrVal 

721 C7GGACA7CG 7GAGCC7G77 CCCCAAC7AC GACAGCCGCA CC7ACCCCA7 CCGCACCG7G 
LeuAspIle ValSerLeu PheProAsn7yr AspSerArg 7hr7yrP.ro IieArg7hrVal 

781 AGCCAGC7GA CCCGCGAGA7 T7ACACCAAC CCCG7GC7GG AGAAC77CGA CGGCAGC77C 
- SerGlnLeu 7hrArgGlu Ile7yr7hrAsn ProValLeu GluAsnPhe AspGiySerPhe 

841 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC A7CCGCAGCC CCCACC7GA7 GGACA7CC7G 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MetAspIleLeu 

901 AACAGCA7CA CCA7C7ACAC CGACGCCCAC . CGCGGCGAG7 AC7AC7GGAG CGGGCACCAG 
AsnSerlle 7hrlle7yr 7hrAspAlaHis ArgGlyGlu 7yr7yr7rp SerGlyHisGln 

961 ATCATGGCCA GCCCCGTCGG CTTCAGCGGC CCCGAGTTCA CCTTCCCCCT GTACGGCACC 
IleMetAla SerProVai GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 
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1C21 A7GGGCAACG C7GCACC7CA GCAGCGCA7C G7GGCACAGC TGGGCCAGGG AG7G7ACCC 
MetGlyAsn AiaAlaPro GlnGlnArglle ValAlaGin LeuGlyGIn GiyVaiTyrArg 

ICS! ACCCTGAGCA GCACCC7GTA CCG7CGACC7 7TCAACA7CG GCA7CAACAA CCAGCAGC*- 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnG^nLeu 

1141 AGCGTGCTGG ACGGCACCGA G7TCGCC7AC GGCACCAGCA GCAACCTGCC CAGCGCCG~G 
aerValLeu AspGlyThr GIuPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVai ' 

1201 TACCGCAAGA GCGGCACCG7 GGACAGCC7G GACGAGATCC CCCC7CAGAA CAACAACTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAs^SvI? 

12 61 CCACC7CGAC AGGGCTTCAG CCACCG7C7G AGCCACG7GA GCA7G77CCG CAGTGGCTTC 
ProProArg GlnGlyPhe- SerHisArgLeu SerHisVal SerMerPhe ArgSerGlyPhe 

1321 c^ C t GCA c GCGT ? AGCAT CATCC GTGCA.CCTATG77CA GC7GGA7TCA CCGCAG7GC* 
SerAsnSer SerVaiSer IlelleArgAla ProMetPhe SerTrpHe HisArgSerAil 

■■ GAGTTCAACA ACATCA7CCC CAGCAGCCAG ATCACCCAGA 7CCCCC7GAC CAAGAGCACT 
GiuPheAsn Asnllelle ProSerSerGln IleThrGin UeProLeu ThrLyssS^' 

1441 AACC7GGGCA GCGGCACCAG CG7GGTGAAG GGCCCCGGC7 7CACCGGCGG CGACATCC7G 
AsnLeuGly SerGlyThr SerValValLys GiyProGly PheThrGly GlyAspIleLeu 



^55r GC ^ CA . GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACATCACCGC CCCCC^GAGC 
«rg«r 5 Thr aerPrcGly GlalieSerThr LeuArgVal AsnlieThr AlaPraZiSsS 

1361 CAGCGCTACC GCG7CCGCA7 CCGC7ACGCC AGCACCACCA ACC7GCAG77 CGACACCAGC 
GlnArgTyr ArgVaiArg IleArgTyrAla SerThrThr AsnLeuGln Ph^isThrSer" 

1521 A ^5 AG r GCC , GC ^ CCA y CAA ^CAGGGCAAC 7TCAGCGCCA CCATGAGCAG CGGCAGCAAC 
^eAspGxy ArgProIle AsnGlnGlyAsn PheSerAia ThrMetSer SerGlyterAsn 

1631 C7GCAGAGCG. GCAGC7TCCG CACCG7GGGC 77CACCACCC CCTTCAAC7 m CAGCAACGGC 
neuGlnSer GlySerPhe ArgThrValGly PheThrThr PrcPheAsn PheSe^As^y 

1741 terSeSI? 7 ^ 0 ??? 7 ^ CGCCCACGTG TTCAACAGCG GCAACGAGG7 G7ACATCGAC 
SerSerVal PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlleAsp 

1301 CGCATCGAG7 TCGTGCCCGC CGAGG7GACC TTCGAGGCCG AGTACGACCT GGAGAGGGC T 
Arg^leGlu PheVaiPro AlaGluValThr PheGluAla GluTyrAsp LeuGluArgAla 

1861 CAGAAGGCCG TGAACGAGC7 G77CACCAGC AGCAACCAGA TCGGCC7GAA GACCGACGTG 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 ACCGACTACC ACATCGATCA AG7ATCCAA7 TTAG7TGAG7 GT77ATCTGA TGAAT^TTGT 
ThrAspTyr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 CTGGATGAAA AAAAAGAATT G7CCGAGAAA G7CAAACATG CGAAGCGAC7 TAGTGATGAG 
LeuAspGlu LysLysGlu LeuSerGIuLys ValLysHis AlaLysArg LeuSerAspGlu 
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2C41 CGGAATTTAC 7TCAAGA7CC AAAC777AGA GGGATCAA7A GACAAC7AGA CCG7GGCTGG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GiylleAsn ArgGlnLeu AspArgGlyTrp 

2101 AGAGGAAGTA CGGATAT7AC CATCCAAGGA GGCGATGACG TA7TCAAAGA GAA77ACG77 
ArgGlySer 7hrAspIle 7hrIleGinGiy GlyAspAsp ValPheLys GiuAsn7yrVai 

2161 ACGC7A77GG G7ACC77CGA CGAG7GC7AC CCCACC7ACC 7G7ACCAGAA GA7CGACGAG 
ThrLeuLeu Gly7hr?he AspGluCys7yr ?ro7hr7yr Leu7yrGln LysIleAspGIu 

2221 AGCAAGC7GA AGGCC7ACAC CCGC7ACCAG C7GCGCGGC7 ACA7CGAGGA CAGCCAGGAC 
SerLysLeu LysAla7yr 7hrArg7yrGln LeuArgGly 7yrIleGlu AspSerGlnAsp 

22 31 C7GGAAA7C7 ACC7GA7CCG C7ACAACGCC AAGCACGAGA CCG7GAACG7 GCCCGGCACC 
LeuGluIle 7yrLeuIIe Arg7yrAsnAla LysHisGlu 7hrVaiAsn ValProGIy7hr 

2 2 m1 GGCAGCC7G7 GGCCCC7GAG CGCCCCCAGC CCCA7CGGCA AG7GCGGGGA GCCGAA7CGA 
GiySerLeu 7r?ProLeu SerAlaProSer ProIleGly LysCysGiy GluProAsnArc 

24 21 TGCGC7CCGC ACC7GGAG7G GAACCCGGAC C7AGAC7GCA GC7GCAGGGA CGGGGAGAAG 
CysAIaPro HisLeuGiu 7rpAsnProAsp LeuAspCys SerCysArg AspGiyGIuLys 

24 61 TGCGCCCACC ACAGCCACCA C77CAGCC7G GACA7CGACG 7GGGC7GCAC CGACC7GAAC 

CysAlaHis HisSerHis HisPheSerLeu AspIleAsp ValGlyCys 7hrAspLeuAsr. 

2521 GAGGACC7GG GCG7G7GGG7 GA7C77CAAG A7CAAGACCC AGGACGGCCA CGCCCGCC7G 
GluAspLau GlyValTrp ValllePheLys I leLy sThr GInAspGly HisAlaArgLe. 

25 31 GGCAA7C7AG AA777C7CGA AGAGAAACCA 77AG7AGGAG AAGCAC7AGC 7CG7G7GAAA 

GlyAsnLeu GluPheLeu GluGluLysPro LeuVaiGly GluAlaLeu AlaArgValLys 

2641 AGAGCGGAGA AAAAA7GGAG AGACAAACG7 GAAAAA77GG AA7GGGAAAC AAA7A77G77 
ArgAIaGlu LysLys7rp ArgAspLysArg GluLysLeu Giu7rpGlu 7hrAsnIieVal 

2701 7A7AAAGAGG CAAAAGAA7C 7G7AGA7GC7 77A777G7AA AC7C7CAA7A 7GA7AGA77A 
7yrLysGlu AlaLysGlu SerValAspAia LeuPheVal AsnSerGin 7yrAspArgLeu 

2 7 61 CAAGCGGA7A CCAACA7CGC GA7GA77CA7 GCGGCAGA7A AACGCG77CA 7AGCA77CGA 
GlnAlaAsp 7hrAsnIle AlaMetlleHis AlaAlaAsp LysArgVal HisSerlieArg 

,2321 GAAGC77A7C 7GCC7GAGC7 G7C7G7GA77 CCGGG7G7CA A7GCGGC7A7 7777GAAGAA 
GluAla7yr LeuProGlu LeuSerVallle ProGiyVal AsnAlaAla IlePheGluGlu 

2 881 77AGAAGGGC G7A7777CAC 7GCA77C7CC C7A7A7GA7G CGAGAAA7G7 CA77AAAAAT 
LeuGluGly ArgllePhe 7hrAlaPheSer Leu7yrAsp AlaArgAsn VailleLysAsn 

2941 GG7GA7777A A7AA7GGC77 A7CC7GC7GG AACG7GAAAG GGCA7G7AGA 7G7AGAAGAA 
^ GlyAspPhe AsnAsnGly LeuSerCys7rp AsnValLys GlyHisVal AspValGluGlu 

3001 CAAAACAACC ACCG77CGG7 CC77G77G77 CCGGAA7GGG AAGCAGAAG7 G7CACAAGAA 
GinAsnAsn HisArgSer ValLeuValVal ProGlu7rp GiuAlaGiu ValSerGlnGlu 
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3061 S!T^S T ?V G7 o CG ?? TCG TGGC7ATA7C C77CG7G7CA CAGCGTACAA GGAGGGA7A" 
^alArgVal CysProGly ArgGlyTyrlle LeuArgVal ThrAiaTyr LysGiuGlyTyr 

3121 GGAGAAGG77 GCG7AACCA7 TCATGAGATC GAGAACAA7A CAGACGAAC7 GAAG77TAG" 
GiyGIuGiy CysVal7hr IleHisGluIla GluAsnAsn 7hrAspGiu LeuLysPheSer 

3181 AAC7G7G7AG AAGAGGAAG7 A7A7CCAAAC AACACGG7AA CG7G7AA7GA r TA m ACTGC" 
AsnCysVai GluGluGlu ValTyrProAsn AsnThrVal ThrCyaAsn AspT^rAla 

3241 AC7CAAGAAG AA7ATGAGGG TACGTACACT TCTCG7AATC GAGGA7ATGA CGGAGCC7A" 
7hrGlnGlu Glu7yrGlu Gly7hrTyr7hr SerArgAsn ArgGly7yr AspGiyAlaTyr 

3301 GAAAGCAA77 C77C7G7ACC AGC7GA77A7 GCA7CAGCC7 A7GAAGAAAA AGCA7A7ACA 
GxuSerAsn SerSerVal ProAlaAapTyr AlaSerAla 7yrGluGlu LysAlaTyrTh? 

33 61 GA7GGACGAA GAGACAA7CC 77G7GAA7C7 AACAGAGGA7 A7GGGGA77A CACACCAC-* 

AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly 7yrGlyA S p 7yr7hrProLeC 

34 21 CCAGCTGGCT^ATGTGACAAA AGAA77AGAG TAC77CCCAG AAACCGA7AA GG7ATGGA" 

ProAlaGly 7vrVal7hr LysGluLeuGlu TyrPhePro GluThrAsp LysValTrpIla 

3481 GAGA7CGGAG AAACGGAAGG AACA77CA7C G7GGACAGCG 7GGAATTAC7 7C77A7GGAG 
GxuIleGly GluThrGiu GlyThrPhelle VaiAspSer ValGluLeu LeuLeuMetGlu 
3541 GAA7AA 
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1 ATGGACAACA ACCCCAACA7 CAACGAGTGC A7CCCC7ACA AC7GCC7GAG CAACCCCGA" 
MetAspAsn AsnProAsn IleAsnGluCys IleProTyr AsnCysLeu SerAsnProGl j 

61 G7GGAGG7GC 7GGGCGGCGA GCGCATCGAG ACCGGC7ACA CCCCCATCGA CATCAGCC" 
VaiGluVal LeuGiyGly GluArglleGlu ThrGlyTyr ThrProIle AspIieSerLeu 

121 AGCC7GACCC AG77CC7GC7 GAGCGAGTTC GTGCCCGGCG CCGGCTTCGT GC7GGGCC" - 
SerLeuThr GlnPheLeu LeuSerGIuPhe VaiProGIy AlaGlyPhe VaiLeuGlyLeu 

131 GTGGACATCA 7C7GGGGCA7 CTTCGGCCCC AGCCAGTGGG ACGCCTTCC? GG7GCAGA" 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AspAlaPhe LeuVaiGlnlle 

241 GAGCAGCTGA TCAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCC"- 
GxuGxnLeu IleAsnGln ArglleGIuGlu PheAlaArg AsnGlnAia IleSerArgLe-I 

301 GAGGGCCTGA GCAACCTG7A CCAAA7C7AC GCCGAGAGC7 7CCGCGAG7G GGAGGCCGA" 
GiuGlyLeu SerAsnLeu TyrGlnlleTyr AiaGluSer PheArgGlu TrpGiuAiaAsr 

' 61 S CC ^S CAACC CCGCCCTGCG CGAGGAGATG CGCATCCAGT TCAACGACA7 GAACAGCGC* 
ProThrAsn ProAIaLeu ArgGiuGluMec ArglleGln PheAsnAsp MetAsnSerAlI 

421 CTGACCACCG CCATCCCCC7 GTTCGCCGTG CAGAACTACC AGGTGCCCCT GCTGAGCG"" 
^euThrThr AlaliePro LeuPheAiaVal GlnAsnTyr GlnValPro LeuLeuSerVal 

481 TACGTGCAGG CCGCCAACC7 GCACC7GAGC G7GCTGCGCG ACG7CAGCG7 G77CGGCCA ~ 
-yrVaiGin AlaAlaAsn LeuHisLeuSer ValLeuArg AspValSer VaiPheGlyGl:. 

341 CGC7GGGGC7 TCGACGCCGC CACCA7CAAC AGCCGC7ACA ACGACC7GAC CCGCC7GA~ 
Arg7r?Gly PheAspAla Ala7hrIleAsn SerArgTyr AsnAspLeu ThrArgLeulia 

501 GGCAAC7ACA CCGACCACGC CG7GCGC7GG 7ACAACACCG GCC7GGAGCG CG7G7GGGG" 
-lyAsnTyr 7hrAspHis AlaValArgTrp 7yrAsn7hr GlyLeuGiu ArgVal7rpGl>- 

661 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCGTG 
ProAspSer ArgAspTrp IleArgTyrAsn GlnPheArg ArgGluLeu 7hrLeu7hrVal 

721 C7GGACA7CG 7GAGCC7G77 CCCCAAC7AC GACAGCCGGA CCTACCCCAT CCGCACCG7^ 
LeuAspIie ValSerLeu PheProAsnTyr AspSerArg 7hrTyrPro IleArgThrVal 

7 81 AGCCAGC7GA CCCGCGAGA7 77ACACCAAC CCCG7GC7GG AGAAC77CGA CGGCAGCTTC 
SerGlnLeu ThrArgGlu IieTyrThrAsn ProVaileu GluAsnPhe AspGIySerPhe 

341 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC A7CCGCAGCC CCCACC7GA7 GGACATCCTG 
ArgG^ySer AlaGlnGly IleGluGlySer IleArgSer PrcHisLeu MetAspIIeLeu 

901 AACAGCA7CA CCA7C7ACAC CGACGCCCAC CGCGGCGAG7 AC7AC7GGAG CGGCCACCAG 
AsnSerlle 7hrIleTyr 7hrAspAiaHis ArgGiyGlu TyrTyrTrp SerGlyHisGln 

961 A7CA7GGCCA GCCCCG7CGG C77CAGCGGC CCCGAG77CA CC77CCCCC7 G7ACGGCAC" 
IleMetAla SerProVal GlyPheSerGly ProGluPhe 7hrPhePro LeuTyrGlyThr 
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1021 ATGGGCAACG C7GCACCTCA GCAGCGCATC G7GGCACAGC ■ TGGGCCAGGG AGTGTACCGC 
MetGlyAsn AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGIn GlyVaiTyrArg 

" 1081 ACCCTGAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGinGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GTTCGCCTAC GGCACCAGCA 'GCAACCTGCC CAGCGCCG^Z 
aerValLeu AspGlyThr GluPheAlaTyr GiyThrSer SerAsnLeu ProSerAlaVai 

1201 TACCGCAAGA GCGGCACCGT GGACAGCCTG GACGAGATCC CCCCTCAGAA CAACAACGTG 
TyrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

1261 CCACCTCGAC AGGGCTTCAG CCACCGTCTG AGCCACGTGA GCATG7TCCG CAG7GGC~"C 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVai SerMetPhe ArgSerGiyPhe 

1321 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CCTATGTTCA GCTGGA77CA CCGCAGTGC 
SerAsnSer SerValSer IlelleArgAla ProMetPhe SerTrpIIe HisArgSerAia" 

1551 GAG7TCAACA ACATCATCCC CAGCAGCC AG ATCACCCAGA TCCCCC7GAC CAAGAGCACC 
GiuPheAsn Asnllelie ProSerSerGin IleThrGln IleProLeu ThrLysSerThr 

14 41 AACC7GGGCA GCGGCACCAG CG7GG7GAAG GGCCCCGGC7 7CACCGGCGG CGACATCCTG 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGiy GlyAspIleLeu 

1501 -CGCCGCACCA GCCCCGGCCA GATCAGCACC CTGCGCGTGA ACA7CACCGC CCCCCTGAG'" 
ArgArgThr SerFroGiy GlnlleSerThr LeuArgVai Asnlie7hr AlaPrcLeuSer 

1561 CAGCGCTACC GCG7CCGCA7 CCGCTACGCC AGCACCACCA ACC7GCAGT7 CCACACCAG" 
GlnArgTyr ArgValArg IieArgTyrAia SerThrThr AsnLeuGin PheHisThrSer 

1621 ATCGACGGCC GCCCCA7CAA CCAGGGCAAC T7CAGCGCCA CCA7GAGCAG CGGCAGCAAC 
lieAspGly ArgProIle AsnGlnGlyAsn PheSerAla ThrMetSer SerGlySerAsn 

1631 CTGCAGAGCG GCAGC7TCCG CACCG7GGGC 77CACCACCC CC77CAAC7T CAGCAACGGC 
LeuGlnSer GlySerPhe ArgThrValGly PheThrThr ProPheAsn PheSerAsnGly 

1741 AGCAGCGTGT 7CACCCTGAG CGCCCACGTG 7TCAACAGCG GCAACGAGGT G7ACA7CGAC 
SerSerVai PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu VaiTyrlleAsp 

.1801 CGCATCGAGT 7CG7GCCCGC CGAGG7GACC T7CGAGGCCG AG7ACGACC7 GGAGAGGGC7 
ArglleGlu PheValPro AlaGluValThr PheGluAIa GluTyrAsp LeuGluArgAla 

1861 CAGAAGGCCG 7GAACGAGC7 GTTCACCAGC AGCAACCAGA TCGGCC7GAA GACCGACGTG 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGin IleGlyLeu LysThrAspVal 

1921 ACCGAC7ACC ACATCGATCA GGTGAGCAAC CTGG7GGAG7 GCTTAAGCGA CGAG7TCTGC 
ThrAspTyr HisIleAsp GlnValSerAsn LeuValGiu CysLeuSer AspGluPheCys 

1981 C7GGACGAGA AGAAGGAGCT GAGCGAGAAG GTGAAGCACG CCAAGCGCC7 GAGCGACGAG 
LeuAspGiu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 
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FIG. 11 (CCNT) 

2041 CGCAACC7GC 7GCAGGACCC CAACTTCCGC GGCA7CAACC GCCAGCTGGA CC^CGGCTG- 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GiylieAsn ArgGlnLa^p^Gly^p' 

2101 CGAGGCAGCA CCGATATCAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAAC" ? ACGT'" 
ArgGlySer ThrAspIle ThrlleGlnGly GiyAspAsp ValPheLys Gi^Ty^i 

2161 ACCC7GCTGG GCACCTTCGA CGAGTGCTAC CCCACCTACC TGTACCAGAA ir-r 
ThrLeuLeu GlyThrPhe AsoGiuCysTyr ProThrTyr LeuTyrG^ysli^pctu 

2221 AGCAAGCTGA AGGCCTACAC CCGCTACCAG C7GCGCGGCT ACATCGAGGA CAGCCAGGAC 
SerLysLeu LysAlaTyr ThrArgTyrGin LeuArgGly TyrlleGlu AspSerSiSsp" 

2281 CTGGAAATCT ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACGT GCCCGGCAC" 
LeuGxulie TyrLeuIle ArgTyrAsnAla LysHisGlu ThrVaiAsn Va??^^ 

G^Swf ?GT T GG f C ? TGAG CGCCCCCAGC CCCATCGGCA AGTGCGGGGA GCCGAATCGA 
GxySerLeu TrpProLeu SerAlaProSer ProIleGly LysCysGly GluProAsnA^ 

JysSip^ C HiSJ^t GT S GAACCC ^AC C7AGACTGCA GCTGCAGGGA CGGGGAGAAG 
CysAlaPro HisLeuGlu TrpAsnProAsp LeuAspCys SerCysArg AspGlyGluLys 

l^ C ^ C » AC ^ GC ^ CC ^ C7TCXGCCTG GACATCGACG TGGGCTGCAC CGACCTGAAC 
CysAlaHis HisSerHis HisPheSerLeu AspIieAsp ValGlyCys ThrAsp^uAsn 

2521 GAGGACCTGG GCGTGTGGGT GATCTTCAAG A7CAAGACCC AGGACGGCZA '"GC"CGCTC 
° iUAsp - eu ValliePheLys IlelysThr GlnAspGiy HisAIalrgleu 

2581 GGCAA7C7AG AA777C7CGA AGAGAAACCA 77AG7AGGAG AAGCAC7AGC 7CGT G TGAAA 
ulyAsnLeu GluPheLeu GluGluLysPro LeuVaiGly GluAiaLeu AlaA?gValLya* 

2641 J5a^ G r AGA - AAAAA J GGAG AGACAAAC - GAAAAA77GG AA7GGGAAAC AAA7AT7G7T 
ArgAxaGlu i-ysLys7rp ArgAspLysArg GiuLysLeu GluTrpGlu ThrAsnlleVal 

2701 7A7AAAGAGG CAAAAGAA7C 7G7AGA7GC7 77A777G7AA AC7C7CAA7A -GA-AraTTi 
TyrLysGlu AlaLysGlu SerValAspAla LeuPheVal A iS5e^ T^rAspA^™ 

2761 rw CGGATA ^5 CAACATCGC GAT GA77CA7 GCGGCAGA7A AACGCG77CA 7AGC*7TCGA 
GlnAiaAsp 7hrAsnIle AlaMetlleHis AiaAlaAsp LysArgVal HisSerlleSg 

2821 GAAGCTTA7C TGCCTGAGC7 G7C7G7GA77 CCGGG7GTCA A7GCGGC7A7 *T^GAAGAA 
GluAla7yr LeuProGlu LeuSerVailie ProGlyVal AanAlaAla LePheGluGli 

2881 77AGAAGGGC G7A7777CAC 7GCA77C7CC C7A7A7GA7G CGAGAAA7G7 CA^AAAAAT 
LeuGluGly ArgllePhe 7hrAlaPheSer LeuTyrAsp AlaArgAsn VailieLylto 

2 941 GG7GAT777A ATAATGGCTT A7CC7GC7GG AACG7GAAAG GGCA7GTAGA -GTAGAAGAA 
GlyAspPhe AsnAsnGly LeuSerCysTrp AsnValLys GlyHisVal LpValGiuGlu 

3001 CAAAACAACC ACCG77CGG7 CC77G77G77 CC3GAATGGG AAGCAGAAG7 G7CACAAGAA 
GlnAsnAsn HiaArgSer ValLeuValVal ProGluTr? GluAlaGiu VaiSerGinGlu 
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3061 GTTCGTG7CT GTCCGGGTCG TGGCTATATC 
ValArgVai CysProGly ArgGlyTyrlle 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC 
GlyGluGly CysValThr IleHisGluIle 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC 
AsnCysVal GluGluGlu ValTyrProAsn 

3241 ACTCAAGAAG AATATGAGGG TACGTACACT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr 

3 301 GAAAGCAATT CTTCTGTACC AGCTGATTAT 
GluSerAsn SerSerVal ProAlaAspTyr 

3 3 61 GATGGACGAA GAGACAATCC TTGTGAATCT 
AspGlyArg ArgAspAsn ProCysGiuSer 

3 421 CCAGCTGGCT ATGTGACAAA AGAATTAGAG 
ProAlaGly TyrValThr LysGluLeuGiu 

3481 GAGATCGGAG AAACGGAAGG AACATTCATC 
GluIleGly GluThrGlu GlyThrPhelle 

3541 GAATAA 



) 



CTTCGTGTCA: CAGCGTACAA GGAGGGATA7 
LeuArgVal ThrAlaTyr LysGIuGlyTyr 

GAGAACAA7A CAGACGAACT GAAG7TTAGC 
GiuAsnAsn 7hrAspGIu LeuLysPheSer 

AACACGG7AA CG7G7AATGA 77A7AC7GCG 
Asn7hrVal 7hrCysAsn AspTyrThrAla 

7C7CG7AA7C GAGGA7A7GA CGGAGCC7A7 
SerArgAsn ArgGly7yr AspGiyAla7yr 

GCA7CAGCC7 A7GAAGAAAA AGCATATACA 
AlaSerAia 7yrGluGlu LysAlaTyrThr 

AACAGAGGA7 A7GGGGA77A CACACCAC7A 
AsnArgGly 7yrGlyAsp TyrThrProLeu 

TAC77CCCAG AAACCGA7AA GG7A7GGA77 
7yr?he?ro GluThrAsp Ly sVaI7rpIie 

G7GGACAGCG 7GGAA77AC7 7C77A7GGAG 
ValAspSer ValGluLeu LeuLeuMetGlu 
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1 ATGGACAACA ACCCCAACAT CAACGAGTGC ATCCCCTACA ACTGCCTGAG CAACCCCGAG 
MetAspAsn AsnProAsn IleAsnGiuCys IleProTyr AsnCysLeu SerAsnProGlu 

61 GTGGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGC7ACA CCCCCATCGA CA7CAGCC7G 
ValGluVai LeuGlyGIy GluArgI leGl-j ThrGiyTyr ThrProlie AspIleSerLeu 

121 AGCCTGACCC AG77CC7GC7 GAGCGAGT7C G7GCCCGGCG CCGGC77CGT GC7GGGCC7G 
SerLeu7hr GlnPheLeu LeuSerGluPhe VaiProGly AlaGlyPhe VaiLeuGlyLeu 

181 G7GGACATCA TC7GGGGCA7 C77CGGCCCC AGCCAG7GGG ACGCCTTCC7 GG7GCAGA7C 
ValAspIle Ile7rpGiy IlePheGlyPro SerGlnTrp AspAlaPhe LeuValGinlle 

241 GAGCAGC7GA TCAACCAGCG CATCGAGGAG 77CGCCCGCA ACCAGGCCA7 CAGCGGCCTG 
GluGlnLeu IleAsnGln ArglleGluGiu PheAlaArg AsnGlnAia IleSerArgLeu 

301 GAGGGCC7GA GCAACC7G7A CCAAA7C7AC GCCGAGAGC7 TCCGCGAG7G GGAGGCCGAC 
GluGlyLeu SerAsnLeu 7yrGlnIleTyr AlaGluSer PheArgGlu TrpGluAlaAsp 

361 CCCACCAACC CCGCCC7GCG CGAGGAGA7G CGCATCCAG7 TCAACGACAT GAACAGCGCC 
ProThrAsn ProAlaLeu ArgGiuGluMet ArglieGln PheAsnAsp MecAsnSerAla 

421 C7GACCACCG CCA7CCCCC7 GTTCGCCG7G CAGAACTACC AGGTGCCCCT GCTGAGCGTG 
LeuThrThr AiallePro LeuPheAlaVal GlnAsn7yr GlnValPro LeuLeuSerVal 

481 TACG7GCAGG CCGCCAACC7 GCACC7GAGC G7GC7GCGCG ACG7CAGCG7 G77CGGCCAG 
TyrVaiGlr. AlaAlaAsn LeuKisLeuSer VaiLeuArg AspValSer VaiPheGIyGIr. 

541 CGC7GGGGC7 TCGACGCCGC CACCA7CAAC AGCCGC7ACA ACGACC7GAC CCGCC7GA7C 
Arg7rpGly PheAspAla Ala7hrIleAsn SerArg7yr AsnAspLeu ThrArgLeuIle 

601 GGCAAC7ACA CCGACCACGC CG7GCGC7GG 7ACAACACCG GCC7GGAGCG CG7G7GGGG7 
GiyAsn7yr 7hrAspHis AlaVaiArgTrp 7yrAsn7hr GlyLeuGIu ArgVal7rpGly 

661 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCG7G 
ProAspSer ArgAsp7rp IleArg7yrAsn GirsPheArg ArgGluLeu 7hrLeuThrVal 

"21 C7GGACA7CG 7GAGCC7G77 CCCCAAC7AC GACAGCCGCA CC7ACCCCA7 CCGCACCGTG 
LeuAspIle ValSerLeu PheProAsnTyr AspSerArg ThrTyrPro IleArgThrVal 

781 AGCCAGCTGA CCCGCGAGA7 77ACACCAAC CCCG7GC7GG AGAAC7TCGA CGGCAGC77C 
SerGInLeu 7hrArgGiu IleTyrThrAsn ProValLeu GluAsnPhe AspGlySerPhe 

341 CGCGGCAGCG CCCAGGGCAT CGAGGGCAGC A7CCGCAGCC CCCACCTGAT GGACA7CC7G 
ArgGlySer AlaGlnGly IleGluGlySer IleArgSer ProHisLeu MecAspIleLeu 

901 AACAGCA7CA CCATC7ACAC CGACGCCCAC CGCGGCGAG7 AC7ACTGGAG CGGCCACCAG 
AsnSerlie 7hrIleTyr ThrAspAlaHis ArgGlyGlu TyrTyrTrp SerGlyHisGln 

961 A7CA7GGCCA GCCCCGTCGG C77CAGCGGC CCCGAG77CA CC7TCCCCC? G7ACGGCACC 
IleMetAia SerProVal GlyPheSerGly ProGluPhe 7hrPhePro LeuTyrGIyThr 

46/94 



WO 93/07278 PCT/US92/08476 



FIG. 13 (CONT) 



1021 A7GGGCAACG CTGCACC7CA GCAGCGCATC G7GGCACAGC TGGGCCAGGG AG7G7ACCGC 
MetGlyAsn AlaAiaPro GinGlnArglie ValAlaGln LeuGlyGin GlyVal7yrArg 

-081 ACGCTGAGCA GCACCCTGTA CCG7CGACCT TTCAACATCG GCATCAACAA CCAGCAGCTG 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylleAsn AsnGlnGlnLeu 

1141 AGCGTGCTGG ACGGCACCGA GT7CGCC7AC GGCACCAGCA GCAACC7GCC CAGCGCCG""^ 
SerValLeu AspGlyThr GluPheAIa7yr GiyThrSer SerAsnLeu ProSerAlaVal 

1201 7ACCGCAAGA GCGGCACCG7 GGACAGCC7G GACGAGATCC CCCC7CAGAA CAACAACG7G 
7yrArgLys SerGlyThr ValAspSerLeu AspGluIle ProProGln AsnAsnAsnVal 

12 61 CCACC7CGAC AGGGC77CAG CCACCG7C7G AGCCACG7GA GCA7G77CCG CAG7GGCTT" 

ProProArg GlnGlyPhe SerHisArgLeu SerHisVal Ser.MetPhe ArgSerGlyPhe 

1321 AGCAACAGCA GCG7GAGCA7 CA7CCG7GCA CC7A7G77CA GC7GGATTCA CCGCAGTGC 
SerAsnSer SerValSer IlelieArgAla ProMetPhe Ser7rplle HisAraSerAIa 

1361 GAG77CAACA ACA7CA7CCC CAGCAGCCAG ATCACCCAGA 7CCCCCTGAC CAAGAGCACC 
GluPheAsn Asnllelle ProSerSerGln IleThrGin IleProLeu ThrLysSerThr- 

1441 AACC7GGGCA GCGGCACCAG CG7GG7GAAG GGCCCCGGC7 TCACCGGCGG CGACATCC7G* 
AsnLeuGly SerGlyThr SerValValLys GiyProGly PheThrGly GlyAspIleLeir 

1501 CGCCGCACCA GCCCCGGCCA GA7CAGCACC C7GCGCGTGA ACA7CACCGC CCCCC7GAG" 
ArgArgThr SerProGly Gln:ieSer7hr LeuArgVal AsnlleThr AiaPrcLeuSer- 

15cl CAGCGC7ACC GCG7CCGCA7 CCGC7ACGCC AGCACCACCA ACC7GCAG77 CCACACCAGC 
GlnArgTyr ArgValArg IleArgTyrAla SerThrThr AsnLeuGln PheHisThrSer 

1621 ATCGACGGCC GCCCCATCAA CCAGGGCAAC 7TCAGCGCCA CCA7GAGCAG CGGCAGCAAC 
I^eAspGly ArgProIle AsnGlnGiyAsn PheSerAia ThrMetSer SerGiySerAsn 

1631 CTGCAGAGCG GCAGCTTCCG CACCG7GGGC T7CACCACCC CC77CAAC77 CAGCAACGGC 
i-euGlnSer GiySerPhe ArgThrValGly Phe7hr7hr ProPheAsn PheSerAsnGiy 

1741 AGCAGCG7G7 7CACCC7GAG CGCCCACG7G 77CAACAGCG GCAACGAGG7 G7ACA7CGAC 
SerSerVal PheThrLeu SerAlaHisVal PheAsnSer GiyAsnGiu Val7yrIieAsp 

,1301 CGCA7CGAG7 7CG7GCCCGC CGAGG7GACC T7CGAGGCCG AG7ACGACC7 GGAGAGGGC7 
ArglleGlu PheValPro AlaGluVai7hr PheGluAla GluTyrAsp LeuGiuArgAla 

13 61 CAGAAGGCCG 7GAACGAGC7 G77CACCAGC AGC AAC CAG A TCGGCC7GAA GACGGACGTG 

GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 ACCGAC7ACC ACATCGATCA GG7GAGCAAC CTGG7GGAG7 GC7TAAGCGA CGAG77CTGC 
ThrAspTyr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGiuPheCys 

1981 CTGGACGAGA AGAAGGAGC7 GAGCGAGAAG GTGAAGCACG CCAAGCGCC7 GAGCGACGAG 
LeuAspGlu LysLysGlu LeuSerGluLys ValLysHis AlaLysArg LeuSerAspGlu 
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2041 



CGCAACCTGC TGCAGGACCC CAACTTCCGC GGCA7CAACC GCCAGC7GGA CCGCGGC7GG 
ArgAsnLeu LeuGlnAsp ProAsnPheArg GiylleAsn ArgGinLeu AspArgGIyTr^ 

2101 CGAGGCAGCA CCGATATCAC CATCCAGGGC GGCGACGACG TGTTCAAGGA GAAC"*ACG""" 
ArgG^ySer ThrAspIle ThrlleGlnGiy GiyAspAsp ValPheLys GiuAsnTyrVal 

2161 ACCC7GC7GG GCACC7TCGA CGAGTGCTAC CCCACC7ACC TGTACCAGAA GATCGACGA" 
ThrLeuLeu GlyThrPhe AspGiuCysTyr ProThrTyr LeuTyrGln LysIleAspGlu 

2221 AGCAAGC7GA AGGCCTACAC CCGCTACCAG CTGCGCGGCT ACATCGAGGA CAGCCAGGA" 
SerLysLeu LysAlaTyr ThrArgTyrGln LeuArgGiy TyrlleGlu AspSerGlnAs? 

2281 C7GGAAA7C7 ACCTGATCCG CTACAACGCC AAGCACGAGA CCGTGAACG7 GCCC5GCAC 
LeuGluIle TyrLeuIle ArgTyrAsnAla LysHisGiu ThrVaiAsn ValProGlyTfcr 

2341 GGCAGCC7G7 GGCCCC7GAG CGCCCCCAGC CCCATCGGCA AG7GCGGGGA GCCGAATCG* 
G.ySerLeu 7rpProLeu SerAlaProSer ProIleGly LysCysGly GIuProAsnArJ 

2401 rvS? T p CGC . AC f TGG ^ GTG GAACC --AC CTAGAC7GCA GC7GCAGGGA CGGGGAGAAG 
CysAlaPro riisLeuGlu 7r P AsnProA S p LeuAspCys SerCysArg AspGiyGluLys 

2461 ?v^fStf H AC t GC : ACCA - C7TCAGCC?G 2ACA7CGACG 7GGGC7GCAC CGACC7GAAC 
CysAlaHis riisSerHis HisPheSerLeu AspIleAsp ValGlyCys 7hrAsp£euAtn 

GAGGACC7GG GCG7G7GGG7 GA7C77CAAG A7CAAGACCC AGGACGGCCA CGCCCGCC^ 
^uAsp-eu G,yVal7r? VaillePheLys :ieLys7hr GlnAspGly HisAlaArgLeu 

GGCAA7C7AG AG77CC7GGA GGAGAAGCCC C7GG7GGGCG AGGCCC7GGC CCGCG7GAA" 
GxyAsnLeu GxuPheLeu GluGluLysPro LeuValGly GluAlaLeu AlaArgValLys 

2641 a rS?:?^! AGAAG i GGCG CGACAAGCGC GAGAAGC7GG • AG7GGGAGAC CAACATCG7G 
ArgAlaGxu LysLys7rp ArgAspLysArg GiuLysLeu Glu7rpGlu ThrASnliaVal 

2701 S^rt^^^^S CG7GGACGCC C7G77CG7GA ACAGCCAG7A CGACCGCC7G 
7yrLysGlu AlaLysGlu SerValAspAla LeuPheVal AsnSerGln 7yrAspArgLeu 

2761 CAGGCCGACA CCAACATCGC CA7GA7CCAC GCCGCCGACA AGCGCG7GCA CAGCA77CGC 
GlnAlaAsp 7hrAsnIle AlaMetll.eHis AlaAlaAsp LysArgVal HisSerlleArg 

2821 GAGGCCJACC TGCCCGAGCT GAGCG7GA7C CCCGGCG7GA ACGCCGCCA7 C77CGAGGAA 
GluAla?yr LeuProGiu LeuSerVallie ProGiyVai AsnAlaAia IlePheGluGlu 

2881 C7CGAGGGCC GCA7C77CAC CGCC77CAGC C7G7ACGACG CCCGCAACG7 GA7CAAGAAC 
LeuGluGly ArgllePhe 7hrAlaPheSer LeuTyrAsp AlaArgAsn VallleLysAsn 

2941 GGCGAC77CA ACAACGGCC7 GAGC7GC7GG AACG7GAAGG GCCACG7GGA CG7GGAGGAG 
GxyAspPhe AsnAsnGly LeuSerCys7rp AsnValLys GlyHisVal AspValGluGlu 

3001 rtn^T 00 -^ 0 : 2 ^^ 01 GC?GGTGG - CCCGAG7GGG AGGCCGAGG7 GAGCCAGGAG 

GlnAsnAsn HisArgSer ValLeuValVai ProGluTrp GluAlaGlu ValSerGlnGlu ' 
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3061 
3121 
3181 
3241 
3301 



GTGCGCGTGT GCCCCGGCCG CGGC7ACATC C7GCGCG7GA CCGCC'ACAA rc*r~~rv*r 
VaxArgVal CysProGly ArgGiyTyrlie L-uArgVal^rXfri^y^SGl^ 

GGCGAGGGC7 GCGTGACCAT CCACGAGATC GAGAACAACA CCGACGAGC7 CAAGTTAr" 
GxyGluGxy CysValThr UaHisGluIle GiuAsnAsn ThrAspGIu LeuLysPheSer 

Atn?SSI? G rfrt G ?t GG , T GTACCCCA *C AACACCG7GA CCTGCAACGA C7ACACCGC* 
Asncysval GluGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTvrThrSa 

SS? C t GCC , GCGACAACCC CTGC ^GAGC AACCGCGGCT ACGGCGAC7A CACCCC-G 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly TyrGlyAsp TyrTh^P^S. 

ZCCGCCGGC7 ACGTGACCAA GGAGCTGGAG -\r—<-^r*~r : ir ^r-ir^ 

Pr=Ala Giy TyrValThr LysGluLeuGiu ^^'sJS^S^y^SS 



3421 
3481 
3541 GAG7AG 



Sn?^rf CG r^ G iJ C St GGG CACC ^ CA TC GTGGACAGCG 7GGAGC7GC7 GC7GA-GGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAspSer ValGluLeu leS^uM^S^ 
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1 ATGGACAACA ACCCCAACAT CAACGAGTGC A7CCCC7ACA ACTGCC7GAG CAACCCCGA3 
MetAspAsn AsnProAsn IleAsnGiuCys IlePrcTyr AsnCysLeu SerAsnProGlu 

61 G7GGAGG7GC 7GGGCGGCGA GCGCA7CGAG ACCGGC7ACA CCCCCATCGA CA7CAGCC73 
ValGluVal LeuGlyGIy GiuArglieGlu 7hrGly7yr 7hrProIIe AspIleSerLeu 

121 AGCC7GACCC AG77CC7GC7 GAGCGAG77C G7GCCCGGCG CCGGC77CG7 GC7GGGCC7G 
SerLeu7hr GinPheLeu LeuSerGluPhe ValProGiy AlaGlyPhe VaiLeuGlyLeu 

181 G7GGACA7CA 7C7GGGGCA7 C77CGGCCCC AGCCAG7GGG ACGCCT7CC7 GG7GCAGA7C 
ValAspIle Ile7rpGly IlePheGlyPro SerGln7rp AspAlaPhe LeuValGlnlle 

241 GAGCAGC7GA 7CAACCAGCG CA7CGAGGAG T7CGCCCGCA ACCAGGCCA7 CAGCCGCCTG 
GluGlnLeu IleAsnGln ArglieGluGIu PheAlaArg AsnGlnAla IleSerArgLe*- 

301 GAGGGCC7GA GCAACC7G7A CCAAA7C7AC GCCGAGAGC7 7CCGCGAG7G GGAGGCCGA: 
GluGIyLeu SerAsnLeu 7yrGlnIle7yr AlaGluSer PheArgGlu TrpGluAiaAsr 

3 61- CCCACCAACC CCGCCCTGCG CGAGGAGA7G CGCA7CCAG7 7CAACGACA7 GAACAGCGCI 
ProThrAsn ProAiaLeu ArgGluGIuMet ArglleGln PheAsnAsp MetAsnSerAla 

* 421 C7GACCACCG CCA7CCCCC7 G7TCGCCGTG CAGAAC7ACC AGG7GCCCC7 GC7GAGCG7G 
Leu7hrThr AiallePro LeuPheAlaVai GInAsnTyr GlnValPro LeuLeuSerVal 

481 7ACG7GCAGG CGGCCAACCT GCACZ7G^GC G7GC7GCGCG ACG7CAGCG7 GT7CGGCCA3 
-yrValGlr. AlaAlaAsn leuHi sleuSer ValleuArg AspValSer ValPheGlyGl" 

54 1 CGC7GGGGC7 7CGACGCCGC CACCA7CAAC AGCCGC7ACA ACGACC7GAC CCGCC7GA7C 
Arg7rpGly PheAspAla AIa7hrIleAsn SerArgTyr AsnAspLeu ThrArgLauIle 

501 GGCAAC7ACA CGGACCACGC CG7GCGC7GG 7ACAACACCG GCCTGGAGCG CG7G7GGGG7 
GlyAsn7yr 7hrAspHis . AIaVaIArg7rp TyrAsn7hr GlyLeuGlu ArgValTrpGiy 

661 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCG7G 
PrcAspSer ArgAspT.rp IleArgTyrAsn GlnPheArg ArgGluLeu ThrLeuThrVai 

721 C7GGACA7CG 7GAGCC7G77 CCCCAAC7AC GACAGCCGCA CC7ACCCCA7 CGGCACCG7G 
LeuAspIle ValSerLeu PheProAsn7yr AspSerArg 7hr7yrPro IleArg7hrVa: 

, p 781 AGCCAGC7GA CCCGCGAGA7 T7ACACCAAC CCCGTGCTGG AGAACTTCGA CGGCAGC77C 
SerGlnLeu 7hrArgGiu IleTyrThrAsn ProVaiLeu GIuAsnPhe AspGlySerPhe 

? 841 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC A7CCGCAGCC CCCACCTGA7 GGACA7CC7G 
ArgGlySer AlaGlnGiy ZleGluGiySer IleArgSer ProHisLeu MetAspIleLeu 

901 AACAGCA7CA CCA7C7ACAC CGACGCCCAC CGCGGCGAG7 AC7ACTGGAG CGGCCACCAG 
AsnSerlie 7hrlie7yr ThrAspAiaHis ArgGiyGlu TyrTyrTrp SerGlyHisGlxi 

961 A7CA7GGCCA GCCCCG7CGG C77CAGCGGC CCCGAGT7CA CC77CCCCC7 G7ACGGCACC 
IleMetAla SerProVal GlyPheSerGly ProGluPhe 7hrPhePro LeuTyrGly7hr 
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1021 A7GGGCAACG CTGCACC7CA GCAGCGCATC GTGGCACAGC TGGGCCAGGG AG7G7ACCGC 
MetGlyAsn AlaAlaPro GinGlnArglie ValAlaGln LeuGlyGin GiyVaiTyrArr 

1081 ACCC7GAGCA GCACCCTGTA CCGTCGACCT TTCAACATCG GCATCAACAA CCAGCAGC73 
ThrLeuSer SerThrLeu TyrArgArgPro PheAsnlle GlylieAsn AsnGlnGlr.Leu 

1141 < AGCG7GC7GG ACGGCACCGA G7TCGCC7AC GGCACCAGCA GCAACG7GCC CAGCGCCGTC 
SerValLeu AspGlyThr GluPheAlaTyr GlyThrSer SerAsnLeu ProSerAlaVal 

1201 TACCGCAAGA GCGGCACCG7 GGACAGCC7G GACGAGATCC CCCC7CAGAA CAACAACG7G 
TyrArgLys SerGlyThr ValAspSerLeu AspGiuIle ProProGln AsnAsnAsnVal 

.12 61 CCACCTCGAC ' AGGGCTTCAG CCACCG7C7G AGCCACG7GA GCA7G77CCG CAG7GGC77C 
ProProArg GlnGlyPhe SerHisArgLeu SerHisVal SerMecPhe • ArgSerGiyPhe 

1221 . AGCAACAGCA GCG7GAGCA7 CA7CCG7GCA CC7A7G7TCA GC7GGA77CA CCGCAG7GCC 
SerAsnSer SerValSer IlelieArgAla ProMecPhe Ser7rplle HisArgSerAla 

1351 GAG77CAACA ACA7CATCCC CAGCAGCCAG ATCACCCAGA 7CCCCC7GAC CAAGAGCACI 
GiuPheAsn Asnllelle Pr-oSerSerGin IleThrGin ZieProLeu 7hrLysSerThr 

1441 AACC7GGGCA GCGGCACCAG CG7GG7GAAG GGCCCCGGC7 TCACCGGCGG CGACA7CC7G 
AsnLeuGly SerGlyThr SerValValLys GlyProGly PheThrGly GlyAspIleLeu 

15C1 CGCCGCACCA GCCCCGGCCA GA7CAGCACC C7GCGCG7GA ACA7CACCGC CCCCC-GAG" 
ArrArgThr SerPrcGIy GIr.IleSerThr LeuArgVai AsnlleThr AlaFroLeuSer 

1561 CAGCGC7ACC GCG7CCGCA7 CCGC7ACGCC AGCACCACCA ACC7GCAG77 CCACACCAGC 
GInArgTyr ArgVaiArg IleArgTyrAla SerThrThr AsnLeuGIn PheHis7hrSer 

1621 ATCGACGGCC GCCCCA7CAA CCAGGGCAAC TTCAGCGCCA CCA7GAGCAG CGGCAGCAAC 
lieAspGly ArgProIle AsnGinGlyAsn PheSerAia ThrMetSer SerGiySerAsr. 

1681 C7GCAGAGCG GCAGC77CCG CACCG7GGGC 77CACCACCC CC77CAAC77 CAGCAACGGC 
LeuGlnSer GlySerPhe Arg7hrValGly PheThrThr ProPheAsn PheSerAsnGly 

1741 AGCAGCG7G7 7CACCC7GAG CGCCCACG7G 77CAACAGCG GCAACGAGG7 G7ACA7CGAC 
SerSerVal PheThrLeu SerAlaHisVal PheAsnSer GlyAsnGlu ValTyrlieAsp 

4 1801 CGCATCGAG7 7CG7GCCCGC CGAGG7GACC T7CGAGGCCG AG7ACGACC7 GGAGAGGGC7 
ArglleGlu PheValPro AlaGluVal7hr PheGluAia Glu7yrAsp LeuGluArgAla 

, -361 CAGAAGGCCG 7GAACGAGC7 G7TCACCAGC AGCAACCAGA TCGGCC7GAA GACCGACG7G 
GlnLysAla ValAsnGlu LeuPheThrSer SerAsnGln IleGlyLeu LysThrAspVal 

1921 ACCGACTACC ACATCGATCA AG7ATCCAA7 77AG77GAG7 G77TA7CTGA TGAATTTTG7 
ThrAspTyr HisIleAsp GlnValSerAsn LeuValGlu CysLeuSer AspGluPheCys 

1981 C7GGA7GAAA AAAAAGAA77 G7CCGAGAAA G7CAAACA7G CGAAGCGAC7 TAG7GATGAG 
LeuAspGiu LysLysGlu LeuSerGluLys ValLysHis AiaLysArg LeuSerAspGlu 
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2041 CGGAATTTAC 77CAAGA7CC AAAC777AGA GGGATCAATA' GACAAC7AGA CCG7GGC7GG 
ArgAsnLeu LeuGinAsp ProAsnPheArg GlylleAsn ArgGlnLeu AspArcGlyTro 

2101 AGAGGAAGTA CGGATATTAC CATCCAAGGA GGCGATGACG TATTCAAAGA GAATTACGT" 
ArgGlySer ThrAsplie ThrlleGlnGly GiyAspAsp VaiPheLys GiuAsnTyrVal 

2161 ACGCTATTGG GTACCTTTGA TGAGTGCTAT CCAACGTATT TATATCAAAA AATAGATGA-- 
ThrLeuLeu GiyThrPhe AspGluCysTyr ProThrTyr LeuTyrGln LysIieAspGlu 

2221 TCGAAATTAA AAGCCTATAC CCGTTACCAA T7AAGAGGGT A7A7CGAAGA TAGTCAAGAC 
SerLysLeu LysAiaTyr ThrArgTyrGin LeuArgGly TyrlleGlu AspSerGinAsp 

2281 TTAGAAATCT AT7TAATTCG CTACAATGCC AAACACGAAA CAGTAAATGT GCCAGGTACG 
LeuGluIle TyrLeuIle ArgTyrAsnAia LysHisGlu ThrValAsn ValProGlyThr 

2241 GG7TCCTTAT GGCCGCTTTC AGCCCCAAGT CCAATCGGCA AG7GCGGGGA GCCGAA7CGA 
GiySerLeu 7r?ProLeu SerAlaProSer ProIieGly LysCysGly GluProAsnArc 

2 401 7GCGC7CCGC ACC7GGAG7G GAACCCGGAC C7AGAC7GCA GC7GCAGGGA CGGGGAGAAG 
CysAiaPro HisLeuGlu 7rpAsnProAsp LeuAspCys SerCysArg AspGiyGluLys 

2 4 61 7GCGCCCACC ACAGCCACCA C77CAGCC7G GACA7CGACG 7GGGC7GCAC CGACC7GAAC 
CysAlaHis HisSerHis HisPheSerLeu Asp.IleAsp ValGlyCys ThrAspLeuAsn- 

2521 GAGGACC7GG GCG7G7GGG7 GA7C77CAAG A7CAAGACCC AGGACGGCCA CGCCCGCC7G 
.-•-AspLeu GiyValTrp Val : LePheLys Il.LysThr GInAspGly HisAlaArgLe* 

2 5 31 GGCAATC7AG AA777C7CGA AGAGAAACCA 77AG7AGGAG AAGCAC7AGC 7CG7G7GAAA 
GlyAsnLeu GluPheLeu GluGluLysPro LeuValGly GiuAlaLeu AlaArgValLys 

2 541 AGAGCGGAGA AAAAA7GGAG AGACAAACG7 GAAAAA77GG AA7GGGAAAC AAA7A77G7" 
ArgAlaGlu LysLysTrp ArgAspLysArg GluLysLeu Giu7rpGlu ThrAsnlieVal 

2 701 7A7AAAGAGG CAAAAGAA7C 7G7AGA7GC7 77A777G7AA AC7C7CAA7A 7GA7AGA7TA 
TyrLysGLu AiaLysGlu SerVaiAspAla LeuPheVal AsnSerGin TyrAspArgLeu 

27 61 CAAGCGGA7A CCAACA7CGC GA7GA77CA7 GCGGCAGATA AACGCG77CA 7AGCA7 T CGA 
binAlaAsp 7hrAsnIle AiaMetlleHis AiaAlaAsp LysArgVal HisSerlieArg 

2 821 GAAGC77A7C 7GCC7GAGC7 G7C7G7GA77 CCGGGTG7CA A7GCGGC7A7 7777GAAGAA 
Gj-uAlaTyr LeuProGIu LeuSerVaille ProGlyVai AsnAlaAia IlePheGluGlu 

2381 77AGAAGGGC G7A777TCAC 7GCA77C7CC C7A7A7GA7G CGAGAAA7G7 CA77AAAAA7 
LeuGluGiy ArgllePhe ThrAlaPheSer Leu7yrAsp AiaArgAsn VailieLysAsn 

2 941 GG7GA7777A A7AA7GGC77 A7CC7GC7GG AACG7GAAAG GGCA7G7AGA 7G7AGAAGAA 
GxyAspPhe AsnAsnGly LeuSerCysTrp AsnValLys GiyHisVal AspValGIuGlu 

3001 CAAAACAACC ACCG77CGG7 CC77G77G77 CCGGAA7GGG AAGCAGAAG7 G7CACAAGAA 
GinAsnAsn HisArgSer ValLeuValVal ProGlu7rp GIuAlaGlu VaiSerGlnGlu 
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30 61 GT7CGTGTCT GTCCGGGTCG 7GGC7A7A7C C77CG7G7CA CAGCGTACAA GGAGGGATA" 
ValArgVal CysProGly ArgGIyTyrlle LeuArgVal ThrAiaTyr LysGluGlyTyr 

3121 GGAGAAGG7? GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGly CysValThr IleHisGlulie GluAsnAsn ThrAspGlu LeuLysPheSer 

3181 AACTGTGTAG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA T7ATACTGCG 
AsnCysVal GiuGluGlu ValTyrProAsn AsnThrVal ThrCysAsn AspTyrThrAla 

32 41 ACTCAAGAAG AATATGAGGG 7ACGTACAC7 TCTCGTAATC GAGGA7ATGA CGGAGCC7A7 
7hrGlnGlu GluTyrGlu Gly7hrTyr7hr SerArgAsn ArgGlyTyr AspGlyAlaTyr 

3301 GAAAGCAA77 C77C7G7ACC AGC7GA77A7 GCA7CAGCC7 A7GAAGAAAA AGCA7A7ACA 
GluSerAsn SerSerVal ProAlaAspTyr AlaSerAla TyrGluGlu LysAlaTyr7hr 

3361 GA7GGACGAA GAGACAATCC 77G7GAA7C7 AACAGAGGA7 A7GGGGA77A CACACCAC7A 
AspGlyArg ArgAspAsn ProCysGluSer AsnArgGly 7yrGlyAsp 7yr7hrProLe- 

34 21 CCAGC7GGC7 A7G7GACAAA AGAA77AGAG 7AC77CCCAG AAACCGA7AA GG7A7GGA7T 
ProAlaGly 7yrVal7hr LysGluLeuGlu 7yrPhePro Glu7hrAsp Lysval7rplla 

34 81 GAGA7CGGAG AAACGGAAGG AACA77CA7C G7GGACAGCG 7GGAA77AC7 7C77A7GGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAspSer ValGluLeu LeuLeuMetGl-j 

3541 GAA7AAG 
G 1 u — — — 
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PCR fragment replacing Pith Nco I (ATG start) 

with Bam HI. Pith-S'-GGATCCAACA(ATG)-3 , -syn.-Bt 
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CrvlA(b) Protein Levels in Transgenic Maize 
ELISA 3t Values of Field Plants: 

INBRED X PARENT ABRU PLANT Number ng Bt/mg protein 



2ND01X171-4A 


1646 


29 


5N984X171-4A 


357 


1705 


5N984X171-4A 


870 


1760 


5N984X171-13 


969 


22 


5N984X171-15 


1468 


17 


5N984X171-15 


1470 


28 


5N984X171-14A 


1502 


180 


5N984X171-14A 


1529 


1500 


5N984X 176-11 


1667 


408 


5N984X176-11 


1671 


1270 


5N984X176-11 


1573 


1522 


5N984X 176-11 


1675 


943 


5N984X176-11 


1679 


967 


5N984X171-4B 


1942 


15 


5N984X171-48 


1946 


16 


5NA56X171-16ABX 


1101 


30 


5NA89X176-11 


1622 


959 


5NA89X176-11 


1630 


1 172 


5NA89X176-11 


1635 


1 100 


8F010X171-4 


325 


103 


6F010X171-4 


832 


1298 



-Bt levels are in ng crylA(ta).mg total proton 

-Data are from progeny of the described marzp tran^ormants exDressinq the civiA(b) 
protein. 

•ELISA analysis of transgenic plant material was carried out using standard procedures 
as described elsewhere. 
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FIG . 23C 

CrvlA(b) Protein Levels in Transgenic Maize 
Greenhouse plants 
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-Bt levels are in ng crylA(b)/mg total protein 

Data are from progeny of the described maize transformants expressing the crylA(b) 
protein. 

EL1SA analysis of transgenic plant material was earned out using standard procedures 
as desrnbed elsewhere. 
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FIG. 



23E 



EXPRESSION OF THE CRYIA(b) GENE IN TRANSGENIC MAIZE USING THE PITH- 
PREFERRED PROMOTER 

Leaf samples from small plantiets transformed with oCIB4433 using procedures 
described elsewhere were analyzed for the presence of the crylA(b) protein using 
ELISA. All plants expressing crylA(b) were found to be insecticidal in the standard 
European com borer bioassay. 

Note that the pith-preferred promoter has a low. but detectable level of expression in 
leaf tissue of maize. Detection of CryiA(b) protein 15 consistent with this pattern of 
expression. 



PLANT NUMBER 
JS21A-1 TOP 
JS21A-2TOP 
JS21A-3 TOP 
JS21A-11 TOP 
JS21A-12 TOP 
JS21A-13 TOP 
JS21A-14 TOP 
JS21A-19 TOP 
JS21A-24TOP 
JS21A-28 TOP 
JS22D-3 MID 
JS22D-4 MID 
JS22D-1 1 WO 
JS22D-17 MID 



ng cylA(b)/mg protein 
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FIG. 24 

1 GAATTCGGATCCATTAAAGAAGTCTTTGAACAGATTCTAGAGATCTAGTTTAATGAGCTC 60 

61 CCAAAAGTCTTGAAAAAATTCAGCGGGGAGGCCATTAGGGCAGGGGTACTGTTATGTTTT 120 

121 AAAGAGAACACCACTTTCTTGATCTCTTCTAAAGAGAAATGTTTTGTAAGAAGGATCCTG 180 

181 T C C T C C T CAT CGAAC CT TT TCATC G GCAAAT T T T T CAT AGAGAT AT TAGAGGCAAGAGAG 24 0 

241 GGGCCAAAAAGATCCATGTAAATGGAAGTGGCCACCTGGTTGATACC7CCCTCATCTTCA 300 

301 ACAGAAAATCCATTATGAAAAAGTGAATGGATTTTAAACTCTTCTTTTTCTTCCCTTTTG 360 

3 61 CAATGAGCTGAAAATATCTGGTATTATTCTCATCACCCTCATTAATGAATCTGTCCCTAG 420 

421 CAATTTGCTTTCTCTTGATCCCTTCTGCAGCCACCATGTTTCTTAAATTCCACTCCATAT 480 

481 CAAGCTTTTCC1AATCTATCAGAATCTGAGATGGCTGCAATCTCTCTCATTTTCTCAAGGA 540 

541 TATCGATGTTATCCATAAGGTATTTCTTGAACTTCTTATATTTCCCTTCGACATTTATAT 600 

601 TCCATCCTTTCAACATTTTTTTGTTCAATCTTTTTTGTTTTTTTCCTTTCCAAACATCGA 660 

6 61 TACATTTCCTGCTCCTCACAGGTAAGGACGAGCTTTCAAAAAACCTTCTGCTTTAAAGTC 720 
721 AGGTCTGAGCCTCCAGCAAAGCTCACATATCTAAAGTCCCTCTTCTTAGTTGGGACAGAG 780 

7 81 T CAGT GC T AAGACACAT GGGAACAT GAC CAGAAAAAAAAAAT CAT ATT T AGC C CAGAGAC 840 
841 AACAATATTCTTGTACTGCAAGTCTCGTTATGGGCTAGCAAAGGAATCTACCCAACTTCT 900 
901 CAAATGTGTTGGGATGTCAAGTATATAGACTATTCATCAGTTCCAACTCTATCAAACTGT 960 
961 GCAGCTCAATTATAGAGTTGAATAAAGTGCTCCATCTATTTGTTCTTATCCTCATATTTG 1020 

1021 GTTAAGATATTAAAATCACCTCCCACCAACATTTAAAGTGCACCATTTAAAGTGGCTCGC 1 0 8 C 

10 81 GAGCACCAAACCGCTGAAAACCGGAAATGTTTAGCACGTTGGCAGCGGGACCCTTTTCTA 1140 

1141 TCTCATCGTGTTCTTCGTTGTCCACCACGGCCCACGGGCCAACGCTCCTCCATCCTGTAG 1200 

1201 TGTAGAGTATATTCCATTTGCGACCGAGCCGAGCATCGATCCAGCCACACTGGCCACTGC , 1260 

84 

1261 CAGCCAGCCATGTGGCACTCCTACGTATACTACGTGA GGTGaGAT?^ 1320 

-465 -405 

1321 GGGACCGAGATATTTTAC7GCTGTGGTTGTGTGAGAGATAATAAAGCATTTATGACGATT 138C 

1381 GCTGAACAGCACACACCATGCGTCCAGATAGAGAAAGCTTTCTCTCTTTATTCGCATGCA 1440 

1441 TGTTTCATTATCTTTTATCATATATATATAACACATATTAAATGATTCTTCGTT CCAAT T 1500 

-285 -226 

1501 TATAATTCATTTGACTTTTTTATCCACCGATGCTCGTTTTATTAAAAAAAATATTATAAT 15 60 

-225 -166 

1561 TATTGTTACTTTTTGTTGTAATATTGTTTAGCA TATAATA AACTTTGATACTAGTATGTT 1620 

-165 -106 

49 

1621 TCCGAGCAAAAAAAAATATTAATATTTAGATTACGA GCCCATT^ 1680 

-105 -46 

83 +1 

1681 ACAAGC GAAGGAAAGCAAAGCAA GCTAATGTTGCCCCTGCTGTG C^ GCAGAGGC C C GCT 1740 

-45 +15 

1741 CTTGCTATAAACGAGGCAGCTAGACGCGACTCGACTCATCAGCCTCATCAACCT^A^^ 1800 

+16 +75 
*************** i 

1801 AGGAGGAACGAACGG AGAGGTTGTTGCACAGAAGCGACATGGCTTTCGC 18 60 

+ 76 MAFAPKTS +135 

38 
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FIG. 24 



18 61 CC7GCTCCTCCTCGCTGTCC 7CGGCG7TGCAGGCAGC7CAGTCSCCGCCGCTGCTCC7GA 192: 
+ 126 SSSSLSSALQAAQ-SPPLLLR rl95 

.40 + 41 

1921 GGCGGATGTCGTCGACCGCAACACCGAGACGGAGGTACGACGC GGCCGTCGTCGTCACTA 1 9 8 i 
- +196 RMS S7A7PRRRYDA ~ V V V f 7 +255 

1981 CCACCACCACTGCTAGAGCTGCGGCGGCTGCTGTCACGG7TCCCGCCGCCCCGCCGCAGG 204 C 
+256 T T 7ARAAAAAV7 VPAAP P Q A +315 

75 $ 
2041 CGGGCCGCCGCCGCCGG7GCCACCAAAGCAAGCGGC GGCACCCGC£IGAG^ 2100 
+ 316 GRRRRCHQSKRRHPQRRSRP +375 

2101 CGGTGTCGGACACCATGGCGGCGCTCATGGCCAAGGGCAAGGTTCGTATAGTACGCGCGC 2160 
+376 VSDTMAALMAKGK 

2161 G7G7CG7CG7CG77A7777GCGCA7AGGCGCGGACA7ACACG7GC777AGC7AGC7AACA 222 C 
2221 GCTAGATCATCGGTGCAGACGGCGTTCATCCCGTACATCACCGCCGGCGACCCGGACCTA 228! 

7AFIPYI7 A G D ? D L 

22 81 GCGACGACGGCCGAGGCGC7GCG7C7GC7GGACGGC7G7GGCGCCGACG7CA7CGAGC7G. 234 3 
A77AEALRLLDGC GADVIEL 

2341 GGGGTACCCTGCTCGGACCCCTACATCGACGGGCCCATCATCCAGGCGTCGGTGGCGCGG 2400 
GVPCSDPYIDGPI IQASVAR 

24 01 GC7C7GGCCAGCGGCACCACCA7GGACGCCG7GC7GGAGA7GC7GAGGGAGG7GACGCCG 246! 

ALASGTTMDAVLEMLREVT? 

2 4 61 GAGCTG7CGTGCCCCGTGGTGCTCCTCTCCTACTACAAGCCCATCATGTCTCGCAGCT.TG 252? 
ELSCPVVLLSYYKPIMSRSL 

2521 GCCGAGATGAAAGAGGCGGGGGTCCACGG7AACTATAGCTAGCTCTTCCGATCCCCCTTC 25 8C 
ASMKEAGVH 

25 81 AA77AA77AA777A7AG7AG7CCA77CA7G7GA7GA77T77G77777C77777AC7GACA 264 0 
2 641 GG7C77A7AG7GCC7GA7C7CCCG7ACG7GGCCGCGCAC7CGC7G7GGAG7GAAGCCAAG 2700 

G.LIVPDLP YVAAH SLWSEAK 

27 01 AACAACAACC7GGAGC7GG7AGG77GAA77AAG77GA7GCA7G7GA7GA777A7G7AGC7 27 60 
N N N L E L 

27 61 AGA7CGAGC7AGC7A7AA77AGGAGCA7A7CAGG7GC7GC7GACAACACCAGCCA7ACCA 2820 

VLL T7PAI? 

2821 GAAGACAGGA7GAAGGAGA7CACCAAGGC77CAGAAGGC77CG7C7ACC7GG7AG77A7A 28 80 
E DRMKS 17 K A S S G FVYL 

28 81 7G7A7A7A7AGA7GGACGACG7AAC7CA77CCAGCCCCA7GCA7A7A7GGAGGC77CAA7 2940 
2941 7C7GCAGAGACGACGAAGACCACGACGACGAC7AACAC7AGC7AGGGGCG7ACG77GCAG 3000 

3001 G7GAGCG7GAACGGAG7GACAGG7CC7CGCGCAAACG7GAACCCACGAG7GGAG7CAC7C 30 60 
VSVNGVTGPRANVNPRVSSL 



68/94 



WO 93/07278 



PCT/US92/08476 



FIG. 24 



3 061 A7CCAGGAGG77AAGAAGG7GAC7AACAAGCCCG77GC7G7TGGC77CGGCA7A7CCAAG 3120 
IQSVKKVTNKPVAVGFGISK 

3121 C C C GAGCAC G T GAAGCAGG7 AC G T AC G T AGC T G AC C AAAAAAAAC 7 G77 AAC AAG 777 7G 3180 
? E H V K 

3131 7TTGACAAGCCGGCTACTAGC7AGC7AACAG7GATCAG7GACACACACACACACACAGAT 3240 

Q ^ 

3241 7GCGCAG7GGGGCGC7GACGGGG7GA7CA7CGGCAGCGCCA7GG7GAGGCAGC7GGGCGA . 3300 
AQWGADGVI IGSAMVRQLG3 

3301 AGCGGCTTCTCCCAAGCAAGGCCTGAGGAGGCTGGAGGAGTATGCCAGGGGCATGAAGAA 33 60 
A A S P KQGLRRLE .E YARGMXN 
+++ 

33 61 CGCGC7GCCATGAGTCCATGACAAAG7AAAACGTACAGAGACACTTGATAATATCTATCT 3420 

ALP 

3421 A7CA7C7CGGAGAAGACGACCGACCAA7AAAAA7AAGCCAAG7GGAAG7GAAGC77AGC7 34 80 

34 81 G7A7A7ACACCG7ACG7CG7CG7CGTCGT7CCGGATCGA7C7CGGCCGGC7AGC7AGCAG 3540 
3 5 4 1 AACG7G7ACG7AG7AG7A7G7AA7GCA7GGAG7G7GGAGC7AC7AGC7AGC7GGCCG77C 3 6 0 Z 
3 601 A77CGAT7A7AA7TC77CGC7C7GC7G7GG7AGCAGA7G7ACC7AG7CGA7CTTG7ACGA k * 36 6 C 

3661 CGAAGAAGC7GGC7AGC7AGCCG7C7CGA7CG7A7A7G7AC7GA77AATC7GCAGAT7GA 3720 

$ 

3721 ATAAA AAC T ACAG7AC GCA7 A7GA7 GC G7 AC G 7 AC G 7 GTG 7 A7 AG7 7 7G7GC7CA7ATAT 3780 

37 81 GC7CC7CA7CACC7GCC7GA7C7GCCCA7CGA7C7C7C7CG7AC7CC77CC7G77AAA7G 3840 

3 841 CC77C777GACAGACACACCACCACCAGCAGCAG7GACGCTC7GCACGCCGCCGC7T7AA 3900 

3 901 GACA7G7AAGATAT777AAGAGG7A7AAGA7ACCAAGGAGCACAAA7CTGGAGCACTGGG 3962 

3 9 61 A7 A7 7GC AAAGACAAAAAAAAAACAAAA7 7 AAAG 7 C C C AC CAAAG 7 AGAGA7 AG7 AAAGA . 4020 

4 021 GG7GGA7GGA77AAAA7TA7C7CA7GA77777GGA7C7GC7CAAA7AGA7CGA7A7GG7A 4 080 
4 0 81 T7CAGA7C7A7G77G7ATAGCC7777CA77AGC7T7C7GAAAAAAAAA7GG7A7GA7GAG 4140 
4141 7GCGGAG7AGC7AGGGC7G7GAAGGAG7CGGATGGGC7TCCACG7AC77G777G7GGCCC 4200 
4 201 7AG7CCGG77C7A777AGG7CCGA7CCGAG7CCGGCA7GG7CCGG77CCA7ACGGGC7AG 42 60 
4261 GACCAAGCTCGGCACG7GAG7777AGGCCCG7CGGC7AGCCCGAGCACGACCCG77777A 4 3 20 
4 321 AAC7GGC7AGGAC7CGCCCA777AA7AAGACAAACA77GCAAAAAA7AGC7C7A777777 4380 
4 381 A777AAAA7A7A77G777A777G7GAAA7G7G7A7TA777G7AA7A7A7A77A77G7A7A 44 40 
4 441 7AG77A7A7C7TCAA7TA7GA777A7AAA7A7G777777A77A7GAAC7CAA7777AAG7 4500 
4 501 T7GA777A7GCG77GGCGGGC7CGAGGAGGCACGG7GAACA77777GGG7CGGGC77AAC 45 60 
4561 GGG7CGGCCCGGCCCGG77CGGCCCA7CCACGGCCCA7CCCG7G7CGGCC7CG77CGG7G 4 620 
4 621 AG77CAGCCCG7CGGACAACCCG7CCCCGGCCCGGA7AA77AA7CGGGCC7AACCG7GGC 4 680 
4 681 G7GC77AAACGG7CCG7GCC7CAACGGACCGGGCCGCGGGCGGCCCG777GACA7CTC7A 4740 
47 41 G7GG7G7GAT7AGAGA7GGCGA7GGGAACCGA7CAC7GA77CCG7G7GGAGAA77CGA7A 4800 
4 801 7CAAGC77A7CGA7ACC 4 817 

Figure 24. 

Entire sequence of the maize 7rpA gene, with introns 
and exons, transcription and translation strats, start and 
, atop of cONA. 

$ = start: and end of cDNA; + 1 = transcription start; 73******* = 
primer extension primer; a = start of translation; +++ = stop 

codon; *= CCAA7 Box, 7A7AA 3ox, poly A addition site. 

I above underlined sequences are ?CR primers. 
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2110 5N98* 

SOQ 70d 

P C BR ES LP MP UP C 30 ES LP MP UP 



21 1D 5N984 

I i i i 

39d 46d 

. ,i i 

S L R P SH LP UP LP MP UP 




Northern blot: showing differencial expression of 
TrpA gene in maize tissues. 2 hour exposure against 
film at -3GC with Zupcnt Cronex intensifying screens. 
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FIG. 25B 



2110 5N984 

60d 70d 

P C 3R ES LP MP UP C BR £S LP MP UP 




211D 5N984 

i i . 

39d 46d 

i ii 1 

S L R P SH LP UP LP MP UP 




Northern blot shewing differential expression of 
TrpA gene in maize tissues. A hour exposure against 
film at -80C with Dupont Cronex intensifying screens. 
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2110 



1G. 25C 



SN984 



60d 



rod 



P C 3R ES LP MP UP C SR ES LP MP UP 



W «3 



211D 



5N984 



39d 



46d 



S L R P SH LP UP LP MP UP 




Northern blot showing differential expression of 
TrpA gene in maize tissues. 13 hour exposure against 
film at -80C with Cupont Cronex intensifyina screens 
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FIG. 25D 



2VJ0 5N984 

SOd 700 

p C BR SS LP MP UP C 3R ES LP MP UP 




21 1D 5N984 

39d 46d 

I h ! 

S L R P SH LP UP LP MP UP 
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211D 5N984 21 1 D 

L P L P S1 S2 S3 S4 ' S5 




Northern bier showing mai:e TrpA gene expression in 
Funk lines HID and 5N984 leaf and pith and the absense 
of expression in 211D seed " total RNA. 

65 hour exposure against fil.-n at-3CC with Dupont Cronex 
intensifying screens. 
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PCT/US92/08476 



9.4- 
4.3- 



10X 5X1X S H EV E B 

o 



2.0- 



1.2 — O • 



Genomic southern cf Funk line 211D probed with -h- 
TrpA cDNA 8-2. B • 3amHI, E = ScoRI, EV = -coRV," " 
H = Hindi 1 1 and S = Sad. 

120 hour exposure against film at-80C with Dupor.t Cr 
intensifying" screens. 
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Tn G A T C 1 2 




Primer extension showing the transcription start of TrpA 
gene and sequencing ladder. 

1 hour exposure against film at-80C with Dupont Cronex 
intensifying screens. 
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•IG. 28B 



42°C 48°G 54°C 
1 2 3 1 2 3 1 2 3 P 




RNase protection cf region from -2 bp to ^22"" bp 
with three annealing temperures . 

16 hour exposure against film at-50C with Oupont Crone 
intensifying screens . 
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FIG. 30 



Maize Pollen CDPK cDNA sequence 

sequence contained in clones pCIB3168 and pCIB3169 



1 TG CAG ATC ATG CAC CAC CTC TCC GGC CAG CCC AAC GTG GTG GGC CTC CGC GGC GCG 
l>Gln Me Mat His His Leu Ser Gl y Gin Pro Asn Va I Val Gl y Lau A rg Gl y Ala 

57 TAC GAG GAC AAG CAG AGC GTG CAC CTC GTC ATG GAG CTG TGC GCG GGC GGG GAG CTC 
19>Tyr Glu Asp Lys Gin Ser Val His Lau Vai Mat Gl u Lau Cys Ala Gy Gy Gl u Leu 



Aval 

114 TTC GAC CGC ATC ATC GCC CGG GGC CAG TAC ACG GAG CGC GGC GCC GCG GkG CTG CTG 
23>Phe Asp Arg lie I le Ala Arg Gl y Gin Tyr Thr Gl u A rg Gly Ala Ala Glu Leu Leu 



l"i CGC 


GCC 


ATC 


GTG 


CAG 


ATC 


GTG 


CAC 


ACC 


TGC 


CAC 


TCC 


ATG 


-»^*«. 


GTG 


ATG 


CAC 


CGG GAC 


57>Arg 


Al a 


1 1 a 


Vai 


Gin 


i i e 


Vai 


Hi s 


Thr 


Cys 


Hi s 


Ser 


Met 


Gly 


Val 


.Met 


His 


Arg Asp 






Aval 
































22S ATC 


AAG 


CCC 


GAG 


AAC 


TTC 


CTG 


CTG 


CTC 




AAG 


GAC 


GAG 


GAC 




CCG 


CTC 




~r> 1 1 e 


Lys 


Pro 


Gl u 


Asn 


Phe 


Leu 


Leu 


Leu 


Ser 


wys 


Asp 


Gl u 


Asp 


Al a 


Pro 


Leu 


Ly s Ala 


235 ACC 


GAC 


TTC 


GGC 


CTC 


TCC 


GTC 


TTC 


TTC 


AAG 


GAG 


GGC 


GAG 


CTG 


CTC 


AGG 


GAC 


ATC GTC 


95 ► Thr 


Asp 


Phe 


Giy 


Leu 


Ser 


Val 


Phe 


Phe 


Lys 


Gl u 


Gly 


Gl u 


Leu 


Lau A rg 


Asp 


i 1 e VaJ 



Aval 

342 GGC AGC GCC TAC TAC ATC GCG CCC GAG GTG CTC AAG AGG AAG TAC GGC CCG GAG GCC 

114 ►Gly Ser Ala Tyr Tyr Me Ala Pro Glu Val Leu Lys Arg Lys Tyr Gly Pro Glu Ala 

:?9 GAC ATC TGG AGC GTC GGC GTC ATG CTC TAC ATC TTC CTC GCC GGC 'GTG CCT CCC TTC 

123>Asp lie Trp Ser Val Gly Val Met Leu Tyr lie Phe Leu Ala Gly Vai Pro Pro Phe 

456 TGG GCA GAG AAC GAG AAC GGC ATC TTC ACC GCC ATC CTG CGA GGG CAG CTT GAC CTC 

132> Trp Ala Glu Asn Glu Asn Gly ile Phe Thr Ala Me Leu Arg Gly Gin Lau Aso Leu 

513 TCC AGC GAG CCA TGG CCA CAC ATC TCC CCG GGA GCC AAG GAT CTC CTC AAG AAG ATG 

171 ►Ser Ser Glu Pro Trp Pro His Ile Ser Pro Gly Ala Lys Asp Leu Val Lys Lys JVfet 

570 CTC AAC ATC AAC CCC AAG GAG CGG CTC ACG GCG TTC CAG GTC CTC AAT C\C CCA TGG 

190>Leu Asn ile Asn Pro Lys Glu Arg Leu Thr Ala Phe Gin Val Lau Asn His Pro Trp 

627 ATC AAA GAA GAC GGA GAC GCG CCT GAC ACG CCG CTT GAC AAC GTT GTT CTC GAC AGG 

209Mle Lys Glu Asp Gly Asp Ala Pro Asp Thr Pro Leu Asp Asn Val Vai Leu Aso Arg 

634 CTC AAG CAG TTC AGG GCC ATG AAC CAG TTC AAG AAA GCA GCA TTG AGG ATC ATA GCT 

22S^Lau Lys Gin Phe Arg Ala Met Asn Gin Phe Lys Lys Ala Ala Lau Arg Ile Me Ala 

741 GGG TGC CTA TCC GAA GAG GAG ATC ACA GGG CTG AAG GAG ATG TTC AAG AAC ATT GAC 

2M> Giy Cys Leu Ser. Glu Glu Glu Me Thr Gly Leu Lys Glu Met Phe Lys Asn Me Asp 

798 AAG GAT AAC AGC GGG ACC ATT ACC CTC GAC GAG CTC AAA CAC GGG TTG GCA AAG CAC 

266>Lys Asp Asn Ser Gly Thr II e Thr Leu Asp Glu Leu Lys His Gly Leu Ala Lys His 

355 GGG CCC AAG CTG TCA GAC AGC GAA ATG GAG AAA CTA ATG GAA GCA GCT GAC GCT GAC 

285^ Gly Pro Lys Leu Ser Asp Ser Glu Met Gi u Lys Lau Met Glu Ala Ala Asp Ala Asp 

EcoRI 

912 GGC AAC GGG TTA ATT GAC TAC GAC GAA TTC GTC ACC GCA ACA GTG CAT ATG AAC AAA 

304>Gly Asn Gly Lau lie Asp Tyr Asp Glu Phe Val Thr Ala Thr Vai His Met Asn Lys 
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969 CT3 


GAT 


AGA 


GAA 


G*G 


CAC 


CTT 


TAC 


ACA 


GCA 


TTC 


CAG 


TAT 


1*1 \— 


GAC AAG GAC AAC 


AGC 


323> Leu Asp A rg 


Gl u 


Gl u 


Hi s 


Leu 


Tyr 


Thr 


Al a 


Ph 


Gl n 


Tyr 


Pne 


Asp Lys Asp Asn 


Ser 


1026 GGG 


TAC 


ATT 


ACT 


AAA 


GAA 


GAG 


CTT 


GAG 


CAC 


GCC 


TTG 


AAG 


GAG 


CAA GGG TTG TAT 


GAC 


342 ► Gl y 


Tyr 


1 1 a 


Thr 


Lys 


Gl u 


Gl u 


Leu 


Gl u 


Hi s 


Al a 


Leu 


Lys 


Gl u 


Gl n Gl y Leu Tyr 


Asp 


1083 GCC 


GAT 


AAA 


ATC' 


AAA 


GAC 


ATC 


ATC 


TCC 


GAT 


GCC 


GAC 


TCT 


GAC. 


AAT GAT GGA AGG 


ATA 


361>Ala 


Asp 


Lys 


1 1 e 


Lys 


Asp 


t le 


1 i a 


Ser 


Asp 


Al a 


Asp 


Ser 


Asp 


Asn Asp Gly Arg 


Me 


1140 GAT 


TAT 


TCA 


GAG 


TTT 


GTG 


GCG 


ATG 


ATG 


AGG 


AAA 


GGG 


ACG 


GCT 


GGT GCC GAG CCA 


ATG 


38C>Asp 


Tyr 


Ser 


Glu 


Phe 


Val 


A! a 


Mat 


tot 


A rg 


Lys 


Gly 


Thr 


Al a 


Gly Ala Glu Pro 


Met 


1197 AAC 


ATC 


AAG 


AAG 


AGG 


CGA 


GAC 


ATA 


GTC 


CTA 


TAG 


TGAAGTGAAGCAGCAAGTCTGTAATGTAATGTG 


399*Asn 


i 1 a 


Lys 


Lys 


A rg 


A rg 


Asp 


Me 


Val 


Leu 


• • 













1252 7ATA:)CV3CTC*AACAAGC^ 
1340 AAAAAAAAAA 
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inner leaf sheath 

inner leaf whorl 

green leaf 

anther 

pollen 

silk 

kernel 

pith 

root 
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Lipman-P arson Protein Alignment 
Gap Penalty: 2. Gap Length Penalty: 12 
Seql Seq2 
potCDPK otn rat c*2 ptn 



Similarity 
Index 



Gap 
Number 



Gap Consensus 
Length Length 



1>551 1 >528 



36.5 



4 



4 297 



oo I CDPK ptn YSMGKELGRGQFGVTHLCTHRTSGEKLACKT I AKRKLAAREDVDDVRREVO I MHHLSGQPNVVGLRGAYE 162 

y : . EIG; G. F: V. : C. . : TS. : A K. ! : XL: AR: : . :• RE. : I • 1 ■ PN- V l* ■ • 
-a: p*2 ptn YQLFEELG<GAFSVVRRCVtCiCTSTCEYAAiCMKT<)CLSAROH-GICLEREARICRLL<-HPN|VRLH0SIS 81 



oo I C2PK ptn :kQSVHLVMELCAGGELFOR! I ARCCYTERGAAELLRA I VQ I VHTCHSMCVMHRO ! KPENFLLLSK0E3A 232 

. LY. : L : GGELf : 1: AR. Y: E : A: : V: H : : HRD: (CPEN: LL SK A 

-at p*c2 ptn EEGF HYLVFOLVTGGELFEO I VAREYYSEAOASHC I HQ I LESVNH I HQHO I VHROLKPENLLLASKCKGA 15* 



poi COP< ptn ?L<ATDFGLSVFFKEGELLR-OIVGSAYYIAPEVL<RK-YGPEAO!WSVGVMLY!FLAGYPPFVAENE^G 300 

::<:DFGL:: :..: : : : . G: : Y: : PEYL: : . YG . 0!W: GV: LY I; L. G PPFV. E* ■ * 
-at pk2 Ptn AVKLADFGLA I EYOGEQQAWFGF AC7PGYLSPEVLRKDPYGKPVD I WACGV ILY I LI VGYPPFVOEDQHK 22' 



Dot CDPK ptn ! FTA I LRGGLDISSEPWPH I SPGAjCOL VKKMIN ! NPKERLTAFQVLNHPVI KEDGDAPOTPLDNVVLDRL 370 

: : I G. 0: : S W : : P. AK: L: : : "!l IHP R: TA Q L: HPW: • * - l 

-or d*2 Dtn - r QO!<AGAY0FPSPEW0TYTPEA<NLINOML T ;VPAKRITA0CALKHPWVCCRSTVASMMHRGETVECL 29" 



do Z2?< otn <CFRAHNGF OCAALR 1 I 3B7 

: : F. A. . : : K A L . : 



- Z 1 Zsk2 3 t - 



=E<"VAR^<.<GA: w "* w 308 
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Lipman-Peatson Protein Alignment 

Gap Penalty: 2. Gap Length Penalty: 12 

Seql Seq2 Slmiartty 

po< CDPK ptn humcama ptn Index 

1 >551 1 >150 40.3 



Gap 
Number 



Gap Consensus 
Length Length 



142 



pol CDP< p 



numcoma ptn 



: r LSEEE I TGLKEHFKN I DKONSCT I TLDELKHCLAKHCPKLSOSEMEKLMEAAOAOGNGL I OYOEFVT ATV U6C 
L:EE:I: : <E F. : OKD. . GT ! T EL . G. : ::: E: : OAOGNG I D- EF- T 

LTEEQ I AEFKEAFSLr OKOGDGT I TTKELCTVHRSL GGNPTEAEL QDM I NEYOAOGNGT I OFPE^LTMMA 7a 



do I CSPK 
numcomo p 



o'^ H-MNKLCREEHirTAfQYFDKDNSGYtTKEELEHALKECG-lYOAOKUOl I SDAOSONDGR I DYSEFVA ??fl 
: M: . 0. EE:: AF: FOKD. :GYI: . EL H. : . : C ..:.:.::!.: AD 0 DC" •• Y FFV 
tn RKMKDTDSEEE I REAFRVFOKDGNGY I SAAELRHVNTNLGEKLTDEEVDEMI READ I DGDGQVNYEEFVQ ia« 



pol CDPK MM 530 

MM 

*umcamo ptr MM 146 
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Lipman-Pearson Protein Alignment 
Gap Penalty: 2. Gap Length Penalty: 12 

Seq1 Seq2 Smiarity Gao Gap Consensus 

pot CDPK otn soybean CDPK ptn Index Number Length Length 

1>551 1>509 SZA 1 1 464 

ooi CDPK otn VLCRPMEDVRATYSBCKELGRCOFGVTHLCTHRTSCEKLACKTUKRKLAAREDVOOVRREVCmHHLSB '50 

M:::: :!::l::l: i I : I I I I : I: . ! I : I : I I : | I I I : I : I | | I : • • I I ■ I I • ! I • ■ ; i | | | 7 
soybeon ZZ?K ptn VLPQRTQN I REVYEVGRKIGOGQFGTTFECTRRASGGKFACKS I PKRKLLCKEDYEDVWRE i C i MHHLSE 5" 

oo. CDPK otn OPNVVCLRGAYEOKOSVHLVHEICAGGEIFDR! 1 ARGOYTERCAAELLRA I VQ [ VHTCHSNCVWHRO | KP 220 

"I: : I: III:.: I I I I I II I: llllllll :: | : i : || : || : I : : I I. | • || nm , 
soybean .C3PK ptn HANVVR I EGTYEDSTAVHL VMELCEGGELFDR I VQKGHYSERQAARL IKT I VEVVEACHSLGVMHROLKP '61 

ool CDPK otn £NFLLLSKDEOAPL<ATDFGLSVFFKEGELLRO I VCSAYY I APEVLICRICYGPEA0IWSVGVML V 1FLAGV 290 

MM .:: Mil: I Ml INI III: I. II : I III: II I I 1 1 I : : . I I I I : I I I: I I: I 1 1- |- I I 
soybeon C:°< otn cNFLFOT I DEDAKL<ATDFGLSVFYKPCESFCOVVGSPYYVAPEVLRKLYGPESDVWSAGV ! LYILLSGV 23 1 

ooi COP< otn PPFVAENENG I F7A I LRGOLDLSSEPVPH I SPGAKOL VKJCMLNI NPKERLTAFQVLNHPWI KEDGDAPOT 350 

I I I I I I: I: I I I : 1 I I: I I : I I I I I: I I: : I | I I: : I I I: I I I: I I I i: II I I I I- •• Ml- 
soybeon :;?< ptn PPFWAESEPC ! FRO I LLGKLDFHSEPVPS I SOSAKOL I RKMLDQNPKTRLTAHEVLRHPW 1 V02N | APOK 2C1 

oo otn "LDNVVlORlKQFRAMNQrKKAALR! I AGCLSEEE ! TGLKEMFKN I OKONSGT ! TLDELKHG : A<HGPKL t30 

' M:: II Mill III:: II: Ml II:: I I I I II : MM II I • n- ■ ,f j 

soybeo- :: = < ot- P , -DSAVLSRL<C r SA - S , < , .<<"4L?V!A£5 , .Sti"!CC , _<£ , _= , <M!3 T ^NSG" , :"-2EL<b:.V;VGSE "~ - 

oo. C3PK otn SOSEHEKL.1EAA0A0GNGL I DYQEFVTATVHNNKLOREEHL YTAFQYFDKONSGY I TKEEL r KALKEQGL 500 

: : M: : : I: : I. I II: I I: : II I I I M: I I I I : | |. || I I I: II M I. ! • I I - IT 

soyoeon ■__<»< ptn MESE 1 KDLMOAAO I OKSGT i OYGEF! AATVHLNKltREtNLVSAFSYFOKOGSGY ! 7L0E I QCACKDFGL *41 

oo. C2?< otn YCAOK KDM SDACSDNOGR I DYSEFVAMMRKGTAGAEPMN I K< 541 

I: . I: ': I: : : !: M I I: II I: I I: I I I ! M: : I: | 

sc,oeo- ::-< c-.- : <r : :::s;ac: ; v :e;-aa--^<cscg; 3^s _ -^< uau 
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FIG . 35 

poi CDPK gene Map (1 > 4165) Site and Sequence 

Enzymes: 6 of 198 enzymes (Filtered) 

Settings : Circular. Certain Sites Only. Standard Genetic Code 

"TAGTAACACCTCTCCAATCGCT TGGGTTGGCACATTCTTAGCTTTTATCACATTTTAAGAAATAGAGTTCACCACCTTC 

i * . — l- , 1 1 ; L ao 

A AAA7A7GCC7A7ACAA7GAA7GA7GC77GGA7GCAA7A TAGC7AGA77CAAC7AGC7ATA7A7GG7CAA7AGAACCC7G 

' 1 ' — 1 — ■ i : ~ 160 

TGAGCACCTCACAAACACGACTTCAATTTTGA GACCCTAAGCGAGTAAATGGTTAAAGTCCTCTTATTATTAGTCTTAGG 

' ' ' • 1 _l 24fj 

ACTTCTCCTTGCTAAATGCTTGTCAGCSATCT ATATATCTTCCCCACTGCGGGAGATACTATATATAGGGCCTTGGACCT 

1 1 ! — ■ 1 ■ i u 320 

CTAGGGTATCTCAAAGGCCTAGTCACAAC AATTCTCAACAGTATTTAATTTTATACATGTATGAACAGTGTAGGAATTTG 

1 = ' i i . . 1 ^ 40Q 

AGTGCCCAACCCAAGAGTGGGAGG TGTAAATTGGGTAGCTAAACTTAAATAGGGCTCTTCTTATTTAGGTTTATCTAGTC 

— 1 ~ ' — ■ « ■ : l. 4ao 

TCTACTTAGACTAAT7CAGAAAG AATTTTACAACCTATGGTTAATCATATCTCTAGTCTAAGCAAATTTAGGAAAGTTAA 

— ■ ■ ■ . 1 1 i u S60 

AAGCACACAATTAGGCACA7GTGAAAGATGTGT ATGGTAAGTAAAAGACTTATAAGGAAAAAGTGGGTGAATCCTCAAGA 

' ' 1 — -u- . . . ; u 6 4o 

TGTGGTGGTATATCCCAATGATAT TAGATGCCAGAATATAGGGGGGAAATCGATGTATACCATCTCTACCAGGATACCTG 

— < ; = ' 720 

TGCGGACTGTGCAAC7GACACATGGACCA 7GGTGTCTTC7TAGATTTGGTTATTAGCTAATTGCGCTACAACTTGTTCAA 

^ ' 1 ' 1 : 1 1 1 ! 1 ^ 800 

2GCTAGACCAAATTAAAAAACTAATATTAAAC ATAAAAAGTTAGGCAAACTATAGTAAATTATGCAGCGATCCAACAACA 

— ! ' « i 1 i i L 

AGCC ATGTCTCGTGGGTCATGAGCCAC3CG TCGGCCATACACCCACATGATGTTTCCATACGGATGGTCCTTATGCAATT 

— 1 ' 1 : ^ 960 

TTGTCTGCAAAACACAAGCCTTAATAC AGCCACGCGACAATCATGGAAGTGGTCGTTTTAGGTCCTCATCATGAAGTTCA 

' ' — ' ■ ■ 1 1 l 1CW0 

GGGAAAACGCATCAAATGTAATGCAG AGAAATGGTATTTCTTCTCTTGTAAATCAGGGAGAGGAGTACCATCAGTACAGA 

— ' ■ ■ 1 i ^ L H20 

: Ecofl ! 

TTCAGAATCAGAATTCAGTCTT CCAACGACAATAATC3CAGCATCTTGTAAAAATTTGCAGAAACTTCTGTTTGACTTGT 

— — 1 ■ ■ > 1 - u 1200 

AGCCCTGACCT7TGCAAAT ATTTGAAGTTGTGCCTGCTGACACAACTTCAATCTGGAAGTGCTGT7GATCAGTTTTGCCA 

- : '> — i ; 1280 

^gaaacagcaagcagcct atatatatctgtcacgagacacc:tgc:gccctcttctttcccgccattccctc::7accc77 

1 1 ! ; i ■ - 1360 
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FIG. 25 (CONT) 

pol CDPK oene Map M > 4165) Site and Sequence 

Xbai 

CAAAATCTAGAAACCTTTT7TTTTCCTCCCGATACGCCCCTCCATCTCTCGCC3TTCATGTCC3TGGCTGGCTGCCCTCC 
— 1 i i ■ 1 i , , L i44Q 



l mflNA start- 1 



* GTGGGAGCAGGCGGCCGCACTCGTTCCCCGCCGCAGCCATGGGCCAGTGCTGCTCCAAGGGCGCCGGAGAGGCCCCGCCA 

1 i i 1 1 1 , ; u !520 



■EXON V 



CCGAGGCGCCAAACGGCAGGCGCCAAGCCGCGGGCGTCCGCGAACAACGCCGACGGACAACGGGCGTCGTCCTCGTCCGC 
i 1 1 1 . 1 ■ 1 l 1 600 



•EXONV 



GGTGGCTGCTGCCGCTGCTGCTGCCGGTGGTGGTGGCGGCGGCACGACGAAGCCGGCCTCACCCACCGGCGGCGCCAGGG 
■, , s , . , , . u 1680 



•EXON1- 



CCAGCTCCGGCAGCAAACCGGCGGCGGCCGTGGGCACGGTGCTGGGCCGGCCCATGGAGGACGTGCGCGCGACCTACTCG 
— 1 1 1 j i i , 1 , u 17SO 



•EXONV 



Ava I 



AT GGGCAAGGAGCTCGGGCGCGGGCAGTTCGGCGTG AC GCACCTGTGC AC GC AC CGGACGAGCGGCG AG AAGCTGGCGTG 

- — — — 1 . 1 ■ i ' ' ' ■ >• 18 z| 0 



■EXON V 



CAAGACGATCGCGAAGCGGAAGCTGGCGGCCAGGGAGGACGTGGACGACGTGCGGCGGGAGGTGCAGATCATGCACCACC 
i l- . i i 1 1 . * J920 



•EXONV 



TCTCCGGCCAGCCCAACGTGGTGGGCCTCCGCGGCGCGTACGAGGACAAGCAGAGCGTGCACCTCGTCATGGAGCTGTGC 

- ■ 1 ! ; ■ 1 1 1 1 ^ 2000 



- EXON 1 - 



Ava I 

GCGGGCGGGGAGCTCTTCGACCGCATCATCGCCCGGGGCCAGTACACGGAGCGCGGCGCCGCGGAGCTGCTGCGCGCCAT 



2080 



• EXON 1 
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pol CDPK gene Map /1 > 4165) 



FIG. 35 (CONT) 
Sit and Sequence 



Ava i 



:gtscagatcgtgcacacctgccactccatggsggtsatgcaccgggacatcaagc'ccgagaac7tcctgctgctcagca 



2160 



■EXON V 



AGGACSAGGACGCGCCSCTCAAGGCCAC:GACTTCGGCCTC7CCGTC77CTTCAAGGAGGGCGAGCTGCTCAGGGACATC 



2240 



• EXON 1 ■ 



Ava I 



G7CGGCAGCGCCTACTACAT CGCSCCCSAGGTGC7CAAGAGGAAG7ACGGCCCGGAGSCCGACATCTGGAGCGTCGGCGT 

1 1 1 1 ' ■ 1 



2320 



■exonv 



Bam H 



CATGCTCTACATCTTCCTCSCCSSCGTeCCTCCCTTCTGSGCAGGTCSGATCCGTCCGTeTTCGTCgTACACGATATACA 



•EXON1- 



2400 



• iNTRON 1 • 



GAACCCGACGATGGAT77GC77CTCAGCCCTGTTC77GCATCACCAGAGAACGAGAACGGCA7C77CACCGCCATCCTGC 



2480 



• INTRON 1 ■ 



■ EXON 



5AGGGCAGC77GACC7C7CCAGCGAGCCA7GGCCACACA7C7CGCCGSGAGCCAAGGA7C7CG7CAAGAAGA7GC7CAAC 



2560 



■EXON 2- 



A7CAACCCCAAGGAGCGGC7CACGGCGT7CCAGG7CC7CAG7AAG7ACCCAGA7CGT7GC7G7CA7ACAC7CATA7GAAT 



•EXON 2- 



2640 



■INTRON 2- 



7G7A7CG77CA7GAGCAACGA7CGAGCGGA777GG7GAAC77G7AGA7CACCCA7GGA7CAAAGAAGACGGAGACGCGCC 



2720 



•INTRON 2- 



■EXON 3- 



TSACACSCCGCTTGACAACGTTGTTCTCGACAGGCTCAAGCAGTTCAGGGCCATGAACCACTTCAACAAAcrAerATTr 



2800 



•EXON 3- 
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FIG. 35 (CONT) 

pol COPK pen Map (1 >4165) She and Sequence 

GGGTACATTATCTGATAAAAGC7CCACAAATACAACTTCTGAAGAACAGCAATGCTTACAC5GCAGAATTTTCATTATAA 
* 1 1 . 1 i . l 2880 



■ INTRON 3- 



A7GC7C77GA7GACA7AA7G77AGA7CA7AGC7GGG7GCC7A7CCGAAGAGGAGA7CACAGGGC7GAAGGAGA7G77CAA 

' ' 1. 2960 



INTRON 3 ' EXON4- 



GAACATTGACAAGGATAACAGCGGGACCATTACCCTCGACGAGCTCAAACACGGGTTGGCAAAGCACGGGCCCAAGC7GT 

' ' ' : ' ! ! 3040 



> EXON 4- 



CAGACAGCGAAATGGAGAAAC7AATGGAAGCAGTGAGTT7TCAGAGTACAATCTTAAAAAAAGGAATTG7GATTCTTTTC 

— i i i i i ; 1 i u 3120 



• EXON 4 1 iNTRON 4- 



AAAATGAAGAAG7AA7C7GAAAACA7CCC7GC7GAAA7GC777A7ACA77TCCAGGC7GACGC7GACGGCAACGGG77AA 

' ' 1 1 ' 1 ^ 1 1 ' L 3200 



4 



- INTRON 4 ' EXCN 5- 



: EcoR! 

rGAC7ACGACGAA77CGTCACCGCAACAGTGCATATGAACAAAC7GGA7AGAGAAGAGCACC77TACACAGCATTCCA6 



- 3280 



EXON 5- 



; EcoR I 

rATTTCGACAAGGACAACAGCSGGTAAGTTGAACGTTAAAATGATACAGC7GGTACC7GAAT7CTGGACAACACATATCA 



3360 



• EXON 5 1 INTRON 5- 



TAACAGGACACATA7A7AA77CG777A7C7CACAGG7ACA77AC7AAAGAAGAGC77GAGCACGCC77GAAGGAGCAAGG 
' ' ! ' ' ^ 2iW0 



iNTRON 5 » EX0N6- 



T G77G7A7GAC3CCGA7AAAA7CAAAGACA7CA7C7CCGA7GCCGAC7C7GACAA7G7AAGGAACAAACA77A777AAA77 



3S20 



> EXON 6 : 1 INTRON 6- 
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FIG. 35 (CONT) 

pot CDPK ociie Mao (1 > 4165^ Site and Sequence 

TCAGCCGACAAACTAAACTATAGAAACCACATCATGATATCAAATTTTGAG GTGGCGGTGCTACAGAAATAGAACCCAGT 

~ ' ' — 1 - 3600 



■iNTRON 6- 



ACACCAAAATGACTAACTTGTCATGATTAGTTGTTCCTCGTAACTGAACATTTGTG TTCTTAGTTTC77ATTGTTAAACC 
. ' ' ' 1 — 3680 



•INTRON 6- 



AAAGAC77AAATTCAC7777GCACAT6C AGCATCGAAGGA7A6AT7ATTCAGAG777GTGGCGATGATGAGCAAACGfiAr 

' ' ' * 3760 



■ INTRON 6 ' EXOflT 



GGCTGGTGCCGAGCCAA7GAACATCAAGAAGAGGCGAGACA7AGTCCTA7AGTGAA GTGAAGCAGWAAGTGTGTAATGTA 

~ ' 1 ' ■ 3840 

=) 

EXON 7 I 

ATGTG7ATAGCAGCTCAAACAAGCAAAT7TGTACATC7GTACACAAATGCAATGGG GTTACTTT7GCAACTTAG7TCATG 

L " ' l — ' 1 ■ ' 3820 

GATGG77GTGTACGTTGTGCTAT7GATTGCAAGTGAT77GAAAGACA TGCATACTTAGGAAC7GAGAAAGATAGATC7AC 

' ' ' 1 1 ' u 4000 

TACTGCTAGAGACAGAACAATAGGATKKYAATTCAGYAAGTGYGTATTTCAG AAGACTACAGCTGGCATC7ATTATTCTC 

' ' ' ' ■ 1 1 4080 

ATTGTCC7CGCAAAAATACTGATSATGCATTTGAGAGAACAATATSCAACAA GATCSAGCTCCCTA7AGTGAG7CGTATT 

' ' ' : ' ' ' 1 4160 

AGGCC 

► 4165 
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rig. 2 7 

E:\Gr2-PA7.SEQ 9/21/92 

NAME: ?CIB55i5 7415 3PS DNA CIRCULAR UPDATED 9/21/92 

DESCRIPTION: pGFl with synthetic geiaer fix. 

*** SEQUENCE - * # 

1 ATGGACAACA ACCCCAACAT CAACGAGTGC A7CCCC7ACA ACTGCCTGAG CAACCCCGAG 
MetAapAan AsnProAsn IleAanGluCys XiePrcTyr AsnCyaLeu SerAanProGlu 

61 G7GGAGGTGC TGGGCGGCGA GCGCATCGAG ACCGGC7ACA CCCCCATCGA CA7CAGCC7G 
ValGiuVal LeuGlyGIy GluArglleGiu ThrGiyTyr ThrProIle AapIIeSerLeu 

121 AGCC7GACCC AG7TCCTGC7 GAGCGAG77C GTGCCCGGCG CCGGCTTCGT GCTGGGCCTG 
SerLeuThr GlnPheLeu LeuSerGluPhe VaiProGly AlaGlyPhe ValLeuGlyLeu 

181 GTGGACATCA TCTGGGGCAT C7TCGGCCCC AGCCAG7GGG ACGCC7TCCT GGTGCAGATC 
ValAspIle IleTrpGly IlePheGlyPro SerGlnTrp AapAlaPhe LeuValGlnlle 

241 GAGCAGC7GA 7CAACCAGCG CATCGAGGAG TTCGCCCGCA ACCAGGCCAT CAGCCGCCTG 
GluGlnLeu XleAanGln ArglleGluGiu PheAlaArg AanGlnAla IleSerArgLeu 

301 GAGGGCCTGA GCAACCTGTA CCAAATC7AC GCCGAGAGC7 TCCGCGAGTG GGAGGCCGAC 
GluGiyLeu SerAsnLeu TyrGlnlleTyr AlaGluSer PheArgGiu TrpGiuAlaAap 

361 CCCACCAACC CCGCCC7GCG CGAGGAGATG CGCATCCAG7 TCAACGACAT GAACAGCGCC 
ProThrAan ProAlaLeu ArgGluGluMet ArglleGln PheAanAsp MetAanSerAla 

421 C7GACCACCG CCATCCCCCT G7TCGCCG7G CAGAAC7ACC AGGTGCCCCT GC7GAGCG7G 
LeuThrThr AlallePro LeuPheAlaVal GlnAanTyr GlnVaiPro LeuLeuSerVal 

4 81 TACGTGCAGG CCGCCAACCT GCACC7GAGC G7GCTGCGCG ACG7CAGCG7 GTTCGGCCAG 
TyrValGln AJLaAlaAsn LeuHisLeuSer ValLeuArg AapValSer ValPheGlyGln 

541 CGCTGGGGC7 7CGACGCCGC CACCATCAAC AGCCGC7ACA ACGACCTGAC CCGCC7GATC 
ArgTrpGly PheAspAla AlaThrlleAan SerArgTyr AanAapLeu ThrArgLeuIle 

601 GGCAAC7ACA CCGACCACGC CG7GCGC7GG 7ACAACACCG GCC7GGAGCG CG7G7GGGG7 
GlyAanTyr ThrAapHia AlaValArgTrp TyrAanThr GlyLeuGlu ArgVal7rpGly 

.661 CCCGACAGCC GCGAC7GGA7 CAGG7ACAAC CAG77CCGCC GCGAGC7GAC CC7GACCGTG 
ProAspSer ArgAapTrp IleArgTyrAan GlnPheArg ArgGluLeu ThrLeuThrVal 

721 C7GGACA7CG 7GAGCC7GTT CCCCAAC7AC GACAGCCGCA CC7ACCCCAT CCGCACCGTG 
LeuAapIle ValSerLeu PheProAsnTyr AspSerArg ThrTyrPro IleArgThrVal 

781 AGCCAGC7GA CCCGCGAGA7 7TACACCAAC CCCG7GC7GG AGAAC77CGA CGGCAGCTTC 
SerGlnLeu 7hrArgGlu IleTyrThrAan ProValLeu GluAanPhe AapGlySerPhe 

841 CGCGGCAGCG CCCAGGGCA7 CGAGGGCAGC ATCCGCAGCC CCCACC7GA7 GGACATCC7G 
ArgGlySer AiaGlnGly IleGluGlySer IleArgSer ProHiaLeu MetAapIleLeu 

901 AACAGCATCA CCA7C7ACAC CGACGCCCAC CGCGGCGAG7 AC7AC7GGAG CGGCCACCAG 
AanSerlle 7hrIleTyr 7hrAapAlaHia ArgGiyGlu TyrTyrTrp SerGlyHiaGIn 

961 ATCATGGCCA GCCCCGTCGG C77CAGCGGC CCCGAGT7CA CC7TCCCCC7 GTACGGCACC 
IleMetAla SerProVal GlyPheSerGly ProGluPhe ThrPhePro LeuTyrGlyThr 

1021 A7GGGCAACG C7GCACC7CA GCAGCGCA7C G7GGCACAGC 7GGGCCAGGG AG7G7ACCGC 
MetGlyAan AlaAlaPro GlnGlnArglle ValAlaGln LeuGlyGln GlyValTyrArg 

1081 ACCC7GAGCA GCACCC7GTA CCGTCGACC7 T7CAACA7CG GCATCAACAA CCAGCAGC7G 
ThrLeuSer SerThrLeu 7yrArgArgPro PheAanlle GlylleAsn AanGlnGlnLeu 

' 1141 AGCG7GC7 GG ACGGCACCGA G77CGCC7AC GGCACCAGCA GCAACC7GCC CAGCGCCGTG 
SerVaiLeu AapGlyThr GluPheJUaTyr GiyThrSer SerAsnLeu ProSerAiaVal 

1201 TACCGCAAGA GCGGCACCG7 GGACAGCC7G GACGAGA7CC CCCC7CAGAA CAACAACG7G 
TyrArgLys SerGlyThr ValAapSerLeu AapGiulie ProProGln AanAsnAanVal 
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1261 CCACC7CSAC AGGGC77CAG CCACCG7C7G AGCCACG7GA GCA7G77CCG CAG7GGCTTC 
ProProArg GlnGlyPhe SerHiaArgLeu SerHiaVal SerMetPhe ArgSerGlyPhe 

1221 AGCAACAGCA GCGTGAGCAT CATCCGTGCA CC7A7G77CA GC7GGA77CA CCGCAG7GCC 
SerAanSer SerValSer IlelleArgAla ProMetPhe SerTrpIIe HiaArgSerAla 

1381 GAG77CAACA ACATCATCCC CAGCAGCCAG A7CACCCAGA 7CCCCCTGAC CAAGAGCACC 
GluPheAsn Aanllelle ProSerSerGln IleThrGln IieProLeu ThrLyaSerTtir 

1441 AACCTGGGCA GCGGCACCAG CGTGGTGAAG GGCCCCGGCT TCACCGGCGG CGACATCCTG 
AanLeuGly SerGlyThr SerValValLya GlyProGly PheThrGly GiyAapIleLeu 

1501 CGCCGCACCA GCCCCGGCCA GATCAGCACC C7GCGCG7GA ACATCACCGC CCCCCTGAGC 
ArgArgThr SerProGly GlnlleSerThr LeuArgVal AanlleThr AlaProLeuSer 

1561 CAGCGC7ACC GCGTCCGCA7 CCGCTACGCC AGCACCACCA ACC7GCAGT7 CCACACCAGC 
GlaArgTyr ArgValArg IleArgTyrAla SerThrThr AanLeuGln PheHiaThrSer 

1621 ATCGACGGCC GCCCCATCAA CCAGGGCAAC TTCAGCGCCA CCATGAGCAG CGGCAGCAAC 
IleAapGIy ArgProIle AsnGlnGlyAan PheSerAla ThrMetSer SerGlySerAsn 

1681 C7GCAGAGCG GCAGCTTCCG CACCGTGGGC 77CACCACCC CC77CAAC77 CAGCAACGGC 
LeuGlnSer GlySerPhe ArgThrValGly PheThrThr ProPheAan PheSerAanGly 

1741 AGCAGCG7GT TCACCCTGAG CGCCCACGTG TTCAACAGCG GCAACGAGGT G7ACATCGAC • 
SerSerVal PheThrLeu SerAlaHiaVal PheAanSer GlyAanGlu ValTyrlleAsp 

1801 CGCATCGAGT 7CGTGCCCGC CGAGG7GACC 77CGAGGCCG AG7ACGACC7 GGAGAGGGC7 
ArglleGiu PheVaJLPro AlaGluVal7hr PheGluAla GlutyrAap LeuGiuArgAla 

1861 CAGAAGGCCG 7GAACGAGC7 GTTCACCAGC AGCAACCAGA 7CGGCCTGAA GACCGACG7G 
GlnLysAla ValAsnGlu LeuPheTtirSer SerAanGln IleGlyLeu LysThrAapVal 

1921 ACCGAC7ACC ACATCGATCA AG7A7CCAA7 77AGTTGAG7 G777A7C7GA 7GAATTTTGT 
7hrAapTyr HialleAap GlnValSerAan LeuValGlu CyaLeuSer AapGluPheCya 

1981 C7GGA7GAAA AAAAAGAATT GTCCGAGAAA G7CAAACA7G CGAAGCGAC7 7AG7GATGAG 
LeuAapGIu LyaLyaGIu LeuSerGluLya ValLysHis AlaLyaArg LeuSerAapGlu 

2041 CGGAA777AC 77CAAGA7CC AAAC77TAGA GGGA7CAA7A GACAAC7AGA CCGTGGC7GG 
ArgAanLeu LeuGinAap ProAanPheArg GlylieAan ArgGlnLeu AapArgGlyTrp 

2101 AGAGGAAG7A CGGA7A77AC CA7CCAAGGA GGCGATGACG 7AT7CAAAGA GAATTACGTT 
ArgGlySer ThrAapIle ThrlleGlnGly GlyAapAap ValPheLya GluAanTyrVal 

2161 ACGC7ATTGG G7ACC777GA 7GAG7GC7AT CCAACG7A77 7A7A7CAAAA AA7AGATGAG 
7hrLeuLeu GiyThrPhe AapGluCyaTyr ProThrTyr LeuTyrGln LyalleAapGlu 

2221 7CGAAA7TAA AAGCC7ATAC CCG77ACCAA 77AAGAGGG7 A7A7CGAAGA 7AGTCAAGAC 
SerLyaLeu LyaAlaTyr 7hrArgTyrGln LeuArgGly TyrlieGIu AapSerGlnAsp 

2281 77AGAAATC7 A777AAT7CG C7ACAA7GCC AAACACGAAA CAGTAAATGT GCCAGGTACG 
LeuGIuIIe 7yrLeuIIe ArgTyrAsnAIa LyaHiaGlu 7hrVaiAan ValProGlyThr 

2341 GG77CC77AT GGCCGC777C AGCCCCAAG7 CCAA7CGGAA AA7G7GGGGA GCCGAATCGA 
GlySerLeu 7r?ProLeu SerAlaProSer ProIIeGIy LyaCyaGly GluProAanArg 

2401 7GCGC7CCGC ACC7GGAG7G GAACCCGGAC C7AGAC7GCA GC7GCAGGGA CGGGGAGAAG 
CyaAlaPro HiaLeuGlu 7rpAsnProAap LeuAapCya SerCyaArg AapGlyGluLya 

24 61 7GCGCCCA7C AT7CCCATCA 7TTC7CC77G GACA77GA7G T73GATGTAC AGACT7AAA7 
CyaAiaHia HiaSerHia HiaPheSerLeu AapIleAap ValGlyCys 7hrAapLeuAan 

2521 GAGGAC77AG GTG7A7GGG7 GATA77CAAG A77AAGACGC AAGA7GGCCA 7GCAAGACTA 
GluAapLeu GXyVal7rp ValllePheLya Ilel-y37hr GlnAapGly HiaAlaArgLeu 

2581 GGAAA7CTAG AA777CTCGA AGAGAAACCA 77AG7AGGAG AAGCAC7AGC 7CG7GTGAAA 
GlyAanLeu GIuPheLeu GluGIuLyaPro LeuValGly GluAlaLeu AlaArgValLya 
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2641 AGAGCGGAGA AAAAA7GGAG AGACAAACG7 GAAAAA77GG AATGGGAAAC AAATATTGTT 
ArgAlaGlu LyaLyaTrp ArgAapLyaArg GluLyaLeu GluTrpGlu ThrAanlleVal 

2701 7A7AAAGAGG CAAAAGAATC TG7AGA7GC7 TTATTTGTAA AC7CTCAA7A TGATAGATTA 
TyrLysGlu AlaLyaGlu SerValAapAla LeuPheVal AanSerGln TyrAapArgLeu 

2761 CAAGCGGATA CCAACATCGC GATGATTCAT GCGGCAGATA AACGCGTTCA TAGCATTCGA 
GlnAlaAap ThrAanlle AlaMetlleHia AlaAlaAap LyaArgVal HiaSerlleArg 

2821 GAAGC77A7C 7GCC7GAGCT G7C7G7GA77 CCGGGTG7CA ATGCGGCTAT TTTTGAAGAA 
GluAiaTyr LeuProGlu LeuSerVallle ProGlyVal AsnAiaAla IlePheGluGlu 

2881 TTAGAAGGGC G7ATTTTCAC 7GCA7TC7CC C7A7A7GA7G CGAGAAATGT CATTAAAAAT 
LeuGluGly ArgllePhe ThrAlaPheSer LeuTyrAap AlaArgAan VaXIleLyaAan 

2941 GG7GA7777A A7AA7GGC77 ATCCTGC7GG AACGTGAAAG GGCATGTAGA TGTAGAAGAA 
GlyAapPhe AanAanGly LeuSerCyaTrp AanValJLya GlyHiaVal AapVaiGluGlu 

3001 CAAAACAACC ACCGTTCGGT CCTTGTTG7T CCGGAATGGG AAGCAGAAGT G7CACAAGAA 
GlnAanAan HiaArgSer ValLeuValVal ProGluTrp GluAlaGlu ValSerGlnGlu 

3061 GTTCGTGTCT GTCCGGGTCG TGGCTATATC C77CG7GTCA CAGCGTACAA GGAGGGATAT 
ValArgVal CysProGly ArgGIyTyrlle LeuArgVal ThrAlaTyr LyaGluGlyTyr 

3121 GGAGAAGGTT GCGTAACCAT TCATGAGATC GAGAACAATA CAGACGAACT GAAGTTTAGC 
GlyGluGly CyaValThr XleHiaGluIle GluAanAan ThrAapGlu LeuLyaPheSer 

3181 AAC7G7G7AG AAGAGGAAGT ATATCCAAAC AACACGGTAA CGTGTAATGA TTATACTGCG 
AanCyaVal GluGluGlu ValTyrProAan AsnThrVal ThrCyaAan AapTyrThrAla 

3241 AC7CAAGAAG AATATGAGGG TACGTACACT TC7CG7AA7C GAGGATATGA CGGAGCCTAT 
ThrGlnGlu GluTyrGlu GlyThrTyrThr SerArgAan ArgGlyTyr AapGlyAiaTyr 

3301 GAAAGCAATT CT7C7G7ACC AGCTGATTAT GCA7CAGCC7 A7GAAGAAAA AGCA7ATACA 
GluSerAan SerSerVal ProAlaAapTyr AlaSerAla TyrGluGlu LyaAlaTyrThr 

3361 GA7GGACGAA GAGACAATCC 77GTGAA7C7 AACAGAGGA7 A7GGGGA7TA CACACCACTA 
AapGlyArg ArgAapAan ProCyaGluSer AanArgGly 7yrGlyAap TyrThrProLeu 

3421 CCAGC7GGC7 ATG7GACAAA AGAATTAGAG 7AC77CCCAG AAACCGA7AA GGTATGGATT 
ProAlaGly TyrValThr LyaGluLeuGlu 7yrPhePro GluThrAap LyaValTrpIle 

3481 GAGATCGGAG AAACGGAAGG AACA77CA7C G7GGACAGCG 7GGAA77ACT TCTTATGGAG 
GluIleGly GluThrGlu GlyThrPhelle ValAapSer ValGluLeu LeuLeuMetGlu 

3541 GAA7AA 
Glu 
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